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TELOMERASE 



TEMPLATE 

CCAAAACCCCAAAAC ■ 
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PANEL A 
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1 

51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151" 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 



CCCCAAAACC 
GTAGTTTAGA 
TTTGGATGAT 
GCTCTTGTAG 
TCGCCTTCAT 
CTCGGATGCA 
AAAGCGGAGA 
AATCAGGTAA 
TTATGGAGAA 
CCTAGCCAAC 
GATCTCGATA 
AAAAAGTTTG 
TATGGGTTTT 
TTGAGACAAT 
AGTTCTGATG 
CTCAATTTAA 
TAATGGAATA 
TTGGATTCTT 
TTACAACAGA 
AAAAGAAGCA 
TTGTTGATTC 
AAAGAAGAAG 
AGGTAAGAGA 
CACAGCTGTT 
GTAAAAAGTA 
TCACATTCAT 
GCAGTCATCC 
AAGAAATGGA 
AAGAATCGAA 
AT TG AAG AG A 
GTAACTTTTA 
CGATCTTCAA 
ATACAAACCT 
AAAGAAAAAA 
AAGATTTATT 
GGGGTTTTGG 



CCAAAACCCC 
AATAAAATAT 
ATAGAAAATT 
TGACAAGAAA 
TGACTATTCC 
AATCTTTATA 
GCAAAGAGTA 
TGAGGATTAT 
AATTACTTAA 
AATGATGAGT 
CATCAGACTT 
ATAATCGAAC 
ATTACAATTG 
TGAAAAAGCT 
TGTATGCCAT 
T GG AT AG C T A 
TACGTTAAAT 
AAAGCATAGA 
TTACCTGTTT 
GGCGAAATGA 
TTCTGTAACC 
AG C TAT C AC A 
GAT AC AT T C A 
ATTTTCTTTT 
TCAAATAAGA 
AGATCGACCT 
GTTTTAAAAA 
GCCGAAATCT 
CTCTAAATCT 
TTGACGAGGC 
TTAATTAGAG 
TTGACGAAAT 
TGGTCAAAAT 
TAAGGCAATA 
TTTTTCAATA 
GG 



AAAACCCCTA 
TATTCCCGCA 
TACTTCCTAA 
GGATGCAAAA 
AAAGTTGCAA 
ACGATTCTTT 
GAAAT TGAAA 
TCTATTTTTT 
TACTAAAAGG 
ATATTAAATT 
ACCAAAGACA 
AGCAGAAGAA 
TTTTAGGTAT 
GTTTACAACT 
TATTTTGTGA 
TAGAAACAAA 
CCTTTGGGAC 
TACACAGAAT 
TGATTACTCT 
AAAGAAGACT 
GGAATTAACA 
ATCCTGATTC 
TTAAAATTCA 
ATCTTAACAA 
GAAGCGCTAG 
T CAT AT AT C C 
TAGTGCTATG 
TAATCAAAAA 
TTCGTTAATA 
AACTGCACAG 
AATAAACTAA 
AAAAGCTGAA 
AT TG AGG AAG 
AATAAAATGA 
ATTTATTGAA 



TAAAAAAAGA 
CAAATGGAGA 
TACATTCAAC 
CATTGAAATC 
AAACAATTAG 
CTTGAGAAAA 
CATTACTAAT 
AGATCACTTC 
TAAACAGTTT 
CAT AT G AG AA 
AACTCGCTAT 
CTTATTGCAT 
CGACGGTGAA 
GAAGGAATCG 
ATTAATCTCA 
CCAAATAAAC 
AAATGCACAC 
GCTTTAGAGA 
TGCTCATCTC 
AAAGAAAGAG 
ACAAGAATAT 
TTAAAGATTT 
TATATTATAG 
TATTTTTTGA 
ACTGAGGTAA 
AATACGATGA 
AGGACTAAAT 
GAATTGCGTC 
AGTATTACCA 
AAGATCATTA 
ATTACTAATA 
CTAAAGTTAG 
GAAAAGAAGA 
GTACAGAAGT 
AAGAGGGGTT 



AAAAATTGAG 
TGGATATTGA 
AAGTATAGCA 
TGGCTCGAAA 
AGTTCTACTT 
TTAGTTTTAA 
GTTTAAATAA 
TTAAGGAGCA 
GGATTATTTC 
TGAGTCAAAG 
AAAACGCAAG 
TTACTATTCG 
CTCCCGAGTC 
CAGTTCTGAA 
AATATCTTAT 
CATGCAAGTT 
TGAATTTATA 
CTGATTTAGC 
TTATATCTTT 
ATTTCAAAAT 
TAGCAACGAA 
CAAAAATTCC 
TTTTTCATTT 
TTAGCTGGAA 
CTTAGCTTAT 
TAAGGAAACA 
TTTTAGAGTC 
GATATTGCAA 
ATCTTGATTG 
AAGAAATAAA 
TAG A GAT C AG 
ACAATAAAAA 
CCAGTTAGCA 
GAAGAAATAA 
TTGGGGTTTT 
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1 AAAACCCCAA AACCCCAAAA CCCCTTTTAG AGCCCTGCAG TTGGAAATAT 

51 AACCTCAGTA TTAATAAGCT CAGATTTTAA ATATTAATTA CAAAAC CTAA 

101 ATGGAGGTTG ATGTTGATAA TCAAGCTGAT AATCATGGCA TTCACTCAGC 

151 TCTTAAGACT TGTGAAGAAA TTAAAGAAGC TAAAACGTTG TACTCTTGGA 

201 TCCAGAAAGT TATTAGATGA AGAAATCAAT CTCAAAGTCA T T AT AAAG AT 

251 TTAGAAGATA TTAAAATATT TGCGCAGACA AATATTGTTG CTACTCCACG 

3 01 AGACTATAAT GAAGAAGATT TTAAAGTTAT TGCAAGAAAA GAAGTATTTT 

3 51 CAACTGGACT AATGATCGAA CTTATTGACA AATGCTTAGT TGAACTTCTT 

4 01 TCATCAAGCG ATGTTTCAGA TAGACAAAAA CTTCAATGAT TTGGATTTCA 
451 ACTTAAGGGA AAT C AAT TAG CAAAGACCCA TTTATTAACA GCTCTTTCAA 
501 CTCAAAAGCA GTATTTCTTT CAAGACGAAT GGAACCAAGT TAGAGCAATG 
551 ATTGGAAATG AGCTCTTCCG ACATCTCTAC ACTAAATATT TAATATTCCA 
6 01 GCGAACTTCT GAAGGAAC T C TTGTTCAATT TTGCGGGAAT AACGTTTTTG 
651 ATCATTTGAA AGT CAACGAT AAGTTTGACA AAAAGCAAAA AGGTGGAGCA 
701 GCAGACATGA ATGAACCTCG ATGTTGATCA ACCTGCAAAT ACAATGTCAA 
751 GAATGAGAAA GATCACTTTC TCAACAACAT CAACGTGCCG AAT TGG AAT A 
801 AT ATGAAAT C AAGAACCAGA ATATTTTATT GCACTCATTT TAATAGAAAT 
851 AACCAATTCT TCAAAAAGCA TGAGTTTGTG AGTAACAAAA ACAATATTTC 
901 AGCGATGGAC AGAGCTCAGA CGATATTCAC G AAT AT ATT C AGATTTAATA 
951 GAATTAGAAA GAAGCTAAAA GATAAGGTTA TCGAAAAAAT TGC CTACATG 

10 01 CTTGAGAAAG TCAAAGATTT TAACTTCAAC TACTATTTAA CAAAATCTTG 

1051 TCCTCTTCCA GAAAATTGGC GGGAACGGAA ACAAAAAATC GAAAACTTGA 

1101 TAAATAAAAC TAGAGAAGAA AAGTCGAAGT ACTATGAAGA GCTGTTTAGC 

1151 TACACAACTG ATAATAAATG CGT CACACAA TTTATTAATG AATTTTTCTA 

12 01 CAATATACTC CCCAAAGACT TTTTGACTGG AAGAAACCGT AAGAATTTTC 

12 51 AAAAGAAAGT TAAGAAATAT GTGGAACTAA ACAAGCATGA ACTCATTCAC 

13 01 AAAAAC T TAT TGCTTGAGAA GATCAATACA AGAGAAATAT CATGGATGCA 

13 51 GGTTGAGACC TCTGCAAAGC ATTTTTATTA TTTTGAT CAC GAAAACATCT 

14 01 ACGT CTTATG GAAATTGCTC CGATGGATAT TCGAGGATCT CGTCGTCTCG 
i2 14 51 CTGATTAGAT GATTTTTCTA TGTCACCGAG CAACAGAAAA GTTACTCCAA 
^ 15 01 AAC CTATTAC TACAGAAAGA ATATTTGGGA CGTCATTATG AAAATGT CAA 
III 1551 TCGCAGACTT AAAGAAGGAA ACGCTTGCTG AGGTCCAAGA AAAAGAGGTT 

16 01 GAAGAATGGA AAAAGTCGCT TGGATTTGCA CCTGGAAAAC TCAGACTAAT 
1651 AC C GAAGAAA ACTACTTTCC GTCCAATTAT GACTTTCAAT AAGAAGATTG 

17 01 TAAATTCAGA CCGGAAGACT ACAAAATTAA CTACAAATAC GAAGTTATTG 
1751 AACTCTCACT TAATGCTTAA GACATTGAAG AATAGAATGT TTAAAGATCC 
1801 TTTTGGATTC GCTGTTTTTA ACTATGATGA TGTAATGAAA AAGTATGAGG 
1851 AGTTTGTTTG CAAATGGAAG CAAGTTGGAC AACCAAAACT CTTCTTTGCA 
19 01 AC TAT GG ATA TCGAAAAGTG ATATGATAGT GTAAACAGAG AAAAACT AT C 
1951 AACATTCCTA AAAACTACTA AATTACTTTC TTCAGATTTC TGGATTATGA 
2 001 CTGCACAAAT TCTAAAGAGA AAGAATAACA TAGTTATCGA TTCGAAAAAC 
2 051 TTTAGAAAGA AAGAAATGAA AGATTATTTT AGACAGAAAT TCCAGAAGAT 
2101 TGCACTTGAA GGAGGACAAT ATCCAACCTT ATTCAGTGTT CTTGAAAATG 
2151 AACAAAATGA CTTAAATGCA AAGAAAACAT TAATTGTTGA AG C AAAG CAA 
22 01 AGAAATTATT TTAAGAAAGA TAACTTACTT CAACCAGTCA TTAATATTTG 

22 51 CCAATATAAT TACATTAACT TTAATGGGAA GTTTTATAAA CAAACAAAAG 
2 3 01 GAATTCCTCA AGGT CTTTGA GTTTCATCAA TTTTGTCATC ATTTTATTAT 

23 51 GC AAC AT TAG AGGAAAGCTC CTTAGGATTC CTTAGAGATG AAT CAATGAA 
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24 01 CCCTGAAAAT CCAAATGTTA ATCTTCTAAT GAGACTTACA GATGACTATC 

24 51 TTTTGATTAC AACTCAAGAG AATAATG C AG TATTGTTTAT TGAGAAACTT 

2501 ATAAACGTAA GTCGTGAAAA TGGATTTAAA TTCAATATGA AGAAACTACA 

2 551 GACTAGTTTT CCATTAAGTC CAAGCAAATT TGCAAAATAC GGAATGGATA 

2601 GTGTTGAGGA GCAAAATATT GTT CAAGATT ACTGCGATTG GATTGGCATC 

2651 TCAATTGATA TGAAAACTCT TGCTTTAATG C CAAATAT T A ACTTGAGAAT 

27 01 AGAAGGAATT CTGTGTACAC TCAATCTAAA CATGCAAACA AAG AAAG CAT 

2 751 CAATGTGGCT CAAGAAGAAA CTAAAGT CGT TTTTAATGAA TAACATTACC 

2801 CATTATTTTA GAAAGACGAT TACAAC CGAA GACTTTGCGA ATAAAACTCT 

2851 CAACAAGTTA TTTATATCAG GCGGTTACAA AT AC AT G C AA T GAG CC AAAG 

2 901 AATACAAGGA CCACTTTAAG AAGAACTTAG C TAT GAG C AG TATGAT CGAC 

2 951 TTAGAGGTAT CTAAAATTAT ATACTCTGTA ACCAGAGCAT TCTTTAAATA 

3 0 01 CCTTGTGTGC AATATTAAGG ATACAATTTT TGGAGAGGAG CATTATCCAG 
3 051 ACTTTTTCCT TAGCACACTG AAGCACTTTA TTGAAATATT CAGCACAAAA 
3101 AAGTACATTT TCAACAGAGT TTGCATGATC CTCAAGGCAA AAGAAG C AAA 
3151 GCTAAAAAGT GACCAATGTC AATCTCTAAT TCAATATGAT GCATAGTCGA 
3 2 01 CTATT CTAAC TTATTTTGGA AAGTTAATTT TCAATTTTTG T C T TAT ATA C 
3 2 51 TGGGGTTTTG GGGTTTTGGG GTTTTGGGG 



FIG. 9 

(CONTINUED) 



1 MEVDVDNQAD NHGIHSALKT CEEIKEAKTL YSWIQKVIRC RNQSQSHYKD 

51 LEDIKIFAQT NIVATPRDYN EEDFKVIARK EVFSTGLMIE LIDKCLVELL 

101 SSSDVSDRQK LQCFGFQLKG NQLAKTHLLT ALSTQKQYFF QDEWNQVRAM 

151 IGNELFRHLY TKYLIFQRTS EGTLVQFCGN NVFDHLKVND KFDKKQKGGA 

2 01 ADMNEPRCCS TCKYNVKNEK DHFLNNINVP NWNNMKSRTR IFYCTHFNRN 
251 NQFFKKHEFV SNKNNISAMD RAQTIFTNIF RFNRIRKKLK DKVIEKIAYM 

3 01 LEKVKDFNFN YYLTKSCPLP ENWRERKQKI ENLINKTREE KSKYYEELFS 
351 YTTDNKCVTQ FINEFFYNIL PKDFLTGRNR KNFQKKVKKY VELNKHELIH 
401 KNLLLEKINT REISWMQVET SAKHFYYFDH ENIYVLWKLL RWIFEDLWS 
451 LIRCFFYVTE QQKSYSKTYY YRKNIWDVIM KMS IADLKKE TLAEVQEKEV 
501 EEWKKSLGFA PGKLRLIPKK TTFRPIMTFN KKIVNSDRKT TKLTTNTKLL 
551 NSHLMLKTLK NRMFKDPFGF AVFNYDDVMK KYEEFVCKWK QVGQPKLFFA 
601 TMDIEKCYDS VNREKLSTFL KTTKLLSSDF WIMTAQILKR KNNIVIDSKN 
651 FRKKEMKDYF RQKFQKIALE GGQYPTLFSV LENEQNDLNA KKTLIVEAKQ 
701 RNYFKKDNLL QPVINICQYN YINFNGKFYK QTKGIPQGLC VSSILSSFYY 
751 ATLEESSLGF LRDESMNPEN PNVNLLMRLT DDYLLITTQE NNAVLFIEKL 
8 01 INVSRENGFK FNMKKLQTSF PLSPSKFAKY GMDSVEEQNI VQDYCDWIGI 
851 S I DMKTLALM PNINLRIEGI LCTLNLNMQT KKASMWLKKK LKSFLMNNIT 
901 HYFRKTITTE DFANKTLNKL FISGGYKYMQ CAKEYKDHFK KNLAMSSMID 
951 LEVSKIIYSV TRAFFKYLVC NIKDTIFGEE HYPDFFLSTL KHFIEIFSTK 

1001 KYIFNRVCMI LKAKEAKLKS DQCQSLIQYD A 
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CCCCAAAACCCCAAAACCCCAAAACCCCTATAAAAAAAGAAAAAATTGAGGTAGTTTAGA 
1 + + + + + + 6Q 

GGGGTTTTGGGGTTTTGGGGTTTTGGGGATATTTTTTTCTTTTTTAACTCCATCAAATCT 

a PQNPKTPKPL*KKKKLR*FR 

b PKTPKPQNPYKKRKN*GSLE 

c PKPQNPKTP I KKEKI EVV * K - 

AATAAAATATTATTCCCGCACAAATGGAGATGGATATTGATTTGGATGATATAGAAAATT 
61 + + + + + + 120 

TTATTTTATAATAAGGGCGTGTTTACCTCTACCTATAACTAAACCTACTATATCTTTTAA 

a NKILFPHKWRWILIWMI*KI 

b I KYYSRTNGDGY* F G * YRKF 

C * N I I PAQMEMDIDLDDIENL- 

TACTTCCTAATACATTCAACAAGTATAGCAGCTCTTGTAGTGACAAGAAAGGATGCAAAA 
121 + + + + + + i 8 o 

ATGAAGGATTATGTAAGTTGTTCATATCGTCGAGAACATCACTGTTCTTTCCTACGTTTT 

a YFLIHSTS IAALVVTRKDAK 

b TS *YIQQV*QLL* *QERMQN- 

c LPNTFNKYSSSCSDKKGCKT- 

CATTGAAATCTGGCTCGAAATCGCCTTCATTGACTATTCCAAAGTTGCAAAAACAATTAG 

181 + + + + + + 240 

GTAACTTTAGACCGAGCTTTAGCGGAAGTAACTGATAAGGTTTCAACGTTTTTGTTAATC 

a H*NLARNRLH*LFQSCKNN* 

b IEIWLEIAFIDYSKVAKTIR- 

C LKSGSKSPSLTIPKLQKQLE- 

AGTTCTACTTCTCGGATGCAAATCTTTATAACGATTCTTTCTTGAGAAAATTAGTTTTAA 

241 + + + + + + 300 

TCAAGATGAAGAGCCTACGTTTAGAAATATTGCTAAGAAAGAACTCTTTTAATCAAAATT 

a SSTSRMQIFITILS*EN*F* 

b VLLLGCKSL*RFFLEKI SFK- 

c FYFSDANLYNDS FLRKLVLK- 

AAAGCGGAGAGCAAAGAGTAGAAATTGAAACATTACTAATGTTTAAATAAAATCAGGTAA 
301 + + + + + + 360 

TTTCGCCTCTCGTTTCTCATCTTTAACTTTGTAATGATTACAAATTTATTTTAGTCCATT 

a KAESKE*KLKHY*CLNKIR* 

b KRRAKSRN*NITNV* I KSGN- 

c SGEQRVEIETLLMFK*NQVM- 

TGAGGATTATTCTATTTTTTAGATCACTTCTTAAGGAGCATTATGGAGAAAATTACTTAA 

361 + + + -- -- + + + 420 

ACTCCTAATAAGATAAAAAATCTAGTGAAGAATTCCTCGTAATACCTCTTTTAATGAATT 

a *GLFYFLDHFLRSIMEKIT* 

b EDYSIF* ITS*GALWRKLLN- 

C RI I LFFRSLLKEHYGENYLI- 
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TACTAAAAGGTAAACAGTTTGGATTATTTCCCTAGCCAACAATGATGAGTATATTAAATT 

421 + + + + + + 480 

ATGATTTTCCATTTGTCAAACCTAATAAAGGGATCGGTTGTTACTACTCATATAATTTAA 

a. Y*KVNSLDYFPSQQ**VY*I 

b TKR*TVWI ISLANNDEYIKF- 

C LKGKQFGLFP* PTMMS ILNS- 

CATATGAGAATGAGTCAAAGGATCTCGATACATCAGACTTACCAAAGACAAACTCGCTAT 

481 + + + + + + 540 

GTATACTCTTACTCAGTTTCCTAGAGCTATGTAGTCTGAATGGTTTCTGTTTGAGCGATA 

a HMRMSQRISIHQTYQRQTRY 

b I*E*VKGSRYIRLTKDKLAI- 

C YENESKDLDTSDLPKTNSL * - 

AAAACGCAAGAAAAAGTTTGATAATCGAACAGCAGAAGAACTTATTGCATTTACTATTCG 

541 + + + + + + 600 

TTTTGCGTTCTTTTTCAAACTATTAGCTTGTCGTCTTCTTGAATAACGTAAATGATAAGC 

a KTQEKV* *SNSRRTYCIYYS 
!f b KRKKKFDNRTAEELIAFT I R - 

O c NARKS L I I EQQKNLLHLLFV- 

1% TATGGGTTTTATTACAATTGTTTTAGGTATCGACGGTGAACTCCCGAGTCTTGAGACAAT 

Us 501 + + + + + + 660 

W ATACCCAAAATAATGTTAACAAAATCCATAGCTGCCACTTGAGGGCTCAGAACTCTGTTA 

. ~ a YGFYYNCFRYRR* TPES * DN 

ill b MGFITIVLGIDGELPSLETI 

1AJ c WVLLQLF *VSTVNSRVLRQL- 

j"4 TGAAAAAGCTGTTTACAACTGAAGGAATCGCAGTTCTGAAAGTTCTGATGTGTATGCCAT 

"Z 661 + + + + + + 720 

;F" ACTTTTTCGACAAATGTTGACTTCCTTAGCGTCAAGACTTTCAAGACTACACATACGGTA 

m a *KSCLQLKESQF*KF*CVCH 

]Z b EKAVYN*RNRSSESSDVYAI 

c KKLFTTEG IAVLKVLMCMPL- 

IU 

TATTTTGTGAATTAATCTCAAATATCTTATCTCAATTTAATGGATAGCTATAGAAACAAA 

721 + - + + + + + 780 

AT AAAAC AC T T AAT T AG AGT T TAT AG AAT AG AG T T AAAT T ACCT AT CG AT AT CTTTGTTT 

a YFVN* SQISYLNLMDSYRNK 

b IL*INLKYLISI*WIAIETN- 

c FCELISNILSQFNG*L* KQT- 

CCAAATAAACCATGCAAGTTTAATGGAATATACGTTAAATCCTTTGGGACAAATGCACAC 

781 + + + + + + 840 

GGTTTATTTGGTACGTTCAAATTACCTTATATGCAATTTAGGAAACCCTGTTTACGTGTG 

a PNK P CKFNG I YVK S FGTNAH 

b QINHASLMEYTLNPLGQMHT 

C K*TMQV*WNIR* ILWDKCTL- 

TGAATTTATATTGGATTCTTAAAGCATAGATACACAGAATGCTTTAGAGACTGATTTAGC 

841 + + + + + + 900 

ACTTAAATATAACCTAAGAATTTCGTATCTATGTGTCTTACGAAATCTCTGACTAAATCG 

a * IYIGFLKHRYTECFRD* FS 

b EFILDS* S IDTQNALETDLA- 

C NLYWILKA* IHRML*RL I * L - 



FIG. 12 

(CONTINUED) 



+ 



12/78 



TTACAACAGATTACCTGTTTTGATTACTCTTGCTCATCTCTTATATCTTTAAAAGAAGCA 

+ + + + + + 960 

AATGTTGTCTAATGGACAAAACTAATGAGAACGAGTAGAGAATATAGAAATTTTCTTCGT 

LQQITCFDYSCSSLISLKEA 
YNRLPVLITLAHLLYL* KKQ- 
TTDYLF*LLLLISYI FKRSR- 



GGCGAAATGAAAAGAAGACTAAAGAAAGAGATTTCAAAATTTGTTGATTCTTCTGTAACC 

+ + + + + + 

CCGCTTTACTTTTCTTCTGATTTCTTTCTCTAAAGTTTTAAACAACTAAGAAGACATTGG 



GEMKRRLKKE I SKFVDS SVT 
AK*KED*RKRFQNLLILL*P- 
RNEKKTKERDFKIC*FFCNR- 

GGAATTAACAACAAGAATATTAGCAACGAAAAAGAAGAAGAGCTATCACAATCCTGATTC 

+ + + + + + 1080 

CCTTAATTGTTGTTCTTATAATCGTTGCTTTTTCTTCTTCTCGATAGTGTTAGGACTAAG 

GINNKNISNEKEEELSQS*F 
ELTTRILATKKKKSYHNPDS - 
N*QQEY*QRKRRRAITI L I L - 

TTAAAGATTTCAAAAATTCCAGGTAAGAGAGATACATTCATTAAAATTCATATATTATAG 

+ + + + + + 1140 

AATTTCTAAAGTTTTTAAGGTCCATTCTCTCTATGTAAGTAATTTTAAGTATATAATATC 

LKISKIPGKRDTFIKIHIL* 
*RFQKFQVREIHSLKFIYYS 
KDFKNSR*ERYIH*NSYI IV- 

TTTTTCATTTCACAGCTGTTATTTTCTTTTATCTTAACAATATTTTTTGATTAGCTGGAA 

+ + + + + + 1200 

AAAAAGTAAAGTGTCGACAATAAAAGAAAATAGAATTGTTATAAAAAACTAATCGACCTT 

FFISQLLFSFILTIFFD*LE 
FSFHSCYFLLS*QYFLISWK- 
FHFTAV I FFYLNN I F * LAGS- 

GTAAAAAGTATCAAATAAGAGAAGCGCTAGACTGAGGTAACTTAGCTTATTCACATTCAT 

+ + - + + + + 1260 

CATTTTTCATAGTTTATTCTCTTCGCGATCTGACTCCATTGAATCGAATAAGTGTAAGTA 

VKSIK*EKR*TEVT*LIHIH 
* KVSNKRSARLR* LSLFTFI - 
KKYQIREALD* GNLAYSHS * - 

AGATCGACCTTCATATATCCAATACGATGATAAGGAAACAGCAGTCATCCGTTTTAAAAA 

+ + + + + + 1320 

TCTAGCTGGAAGTATATAGGTTATGCTACTATTCCTTTGTCGTCAGTAGGCAAAATTTTT 

RSTFIYPIR**GNSSHPF*K 
DRPSYIQYDDKETAVIRFKN- 
IDLHI SNTMIRKQQSSVLKI- 

TAGTGCTATGAGGACTAAATTTTTAGAGTCAAGAAATGGAGCCGAAATCTTAATCAAAAA 

+ + + + + + 1380 

ATCACGATACTCCTGATTTAAAAATCTCAGTTCTTTACCTCGGCTTTAGAATTAGTTTTT 

* CYED* IFRVKKWSRNLNQK 
SAMRTKFLESRNGAE I L I KK 
VL*GLNF* SQEMEPKS * SKR- 
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GAATTGCGTCGATATTGCAAAAGAATCGAACTCTAAATCTTTCGTTAATAAGTATTACCA 

1381 + + + + + + 1440 

CTTAACGCAGCTATAACGTTTTCTTAGCTTGAGATTTAGAAAGCAATTATTCATAATGGT 

a ELRRYCKRIEL* IFR* *VLP 

b NCVDIAKESNSKSFVNKYYQ- 

c IAS ILQKNRTLNLSLIS I T N - 

ATCTTGATTGATTGAAGAGATTGACGAGGCAACTGCACAGAAGATCATTAAAGAAATAAA 

1441 + + + + + + 1500 

TAGAACTAACTAACTTCTCTAACTGCTCCGTTGACGTGTCTTCTAGTAATTTCTTTATTT 

a I LIDCRD*RGNCTEDH*RNK 

b S*LIEEIDEATAQKI IKEIK- 

c LD * LKRLTRQLHRRSLKK* S - 

GTAACTTTTATTAATTAGAGAATAAACTAAATTACTAATATAGAGATCAGCGATCTTCAA 

1501 + + + + + + 1560 

CATTGAAAATAATTAATCTCTTATTTGATTTAATGATTATATCTCTAGTCGCTAGAAGTT 

a VTFIN*RIN* ITNIEISDLQ 

b *LLLIRE*TKLLI *RSAIFN- 

c NFY*LENKLNY*YRDQRSS I - 

TTGACGAAATAAAAGCTGAACTAAAGTTAGACAATAAAAAATACAAACCTTGGTCAAAAT 

1561 + + + + + + 1620 

AACTGCTTTATTTTCGACTTGATTTCAATCTGTTATTTTTTATGTTTGGAACCAGTTTTA 

a LTK* KLN* S * T I KNTNLGQN 

b *RNKS*TKVRQ*KIQTLVKI 

C DE I KAELKLDNKKYKPWS KY- 

ATTGAGGAAGGAAAAGAAGACCAGTTAGCAAAAGAAAAAATAAGGCAATAAATAAAATGA 

1621 + + + + + + 1680 

TAACTCCTTCCTTTTCTTCTGGTCAATCGTTTTCTTTTTTATTCCGTTATTTATTTTACT 

a IEEGK'E DQLAKEKIRQ* I K* 

b LRKEKKTS *QKKK*GNK*NE 

c * GRKRRPVS KRKNKAI NKMS- 

GT A C AG AAG T G AAG AAAT AAAAG AT TTATTTTTTT C AAT AAT T TAT T G AAAAG AG G G G T T 

1681 + + + + + + 1740 

CATGTCTTCACTTCTTTATTTTCTAAATAAAAAAAGTTATTAAATAACTTTTCTCCCCAA 

a VQK* RNKRFI FFNNLLKRGV 

b YRSEEIKDLFFSIIY*KEGF- 

c TEVKK* KIYFFQ* FIEKRGF- 

TTGGGGTTTTGGGGTTTTGGGG 

1741 + + -- 1762 

AACCCCAAAACCCCAAAACCCC 

a LGFWGFG 
b WGFGVLG 
C G V L G F W 
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707 VI . . KNFALQKIG 717 
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ME I ENNQAQQ PKAEKLWWELELEMQENQND I QVRVKIDDPKQYL 
VNVTAACLLQEGSYYQDKDERRYI ITKALLEVAESDPEFICQLA 
VYIRNELYIRTTTNYIVAFCVVHKNTQPFIEKYFNKAVLLPNDL 
LEVCEFAQVLYIFDATEFKNLYLDRILSQDIRKELTFRKCLQRC 
VRSKFSEFNEYQLGKYCTESQRKKTMFRYLSVTNKQKWDQTKKK 
RKENLLTKLQAIKESEDKSKRETGDIMNVEDAIKALKPAVMKKI 
AKRQNAMKKHMKAPKIPNSTLESKYLTFKDLIKFCHISEPKERV 
YKI LGKKYPKTEEE YKAAFGDS AS APFNPELAGKRMKI E I SKTW 
ENELS AKGNTAEVWDNL I S SNQLPYMAMLRNLSNI LKAGVSDTT 
HSIVINKICEPKAVENSKMFPLQFFSAIEAVNEAVTKGFKAKKR 
ENMNLKGQIEAVKEVVEKTDEEKKDMELEQTEEGEFVKVNEGIG 
KQYINS I ELAI KI AVNKNLDEI KGHTAI FSDVSGSMSTSMSGGA 
KKYGS VRTCLECALVLGLMVKQRCEKS S FYI FS S PS SQCNKCYL 
EVDL PGDELRP SMQKLLQEKGKLGGGTDF PYEC IDEWTKNKTHV 
DNIVILSDMMIAEGYSDINVRGSSIVNSIKKYKDEVNPNIKIFA 
VDLEGYGKCLNLGDEFNENNYIKIFGMSDSILKFISAKQGGANM 
VE V I KN FALQ K I GQ K 

FIG. 20 

MSRRNQKKPQAPIGNETNLDFVLQNLEVYKSQIEHYKTQQQQIK 
EEDLKLLKFKNQDQDGNSGNDDDDEENNSNKQQELLRRVNQIKQ 
QVQL I KKVGS KVEKDLNLNEDENKKNGLS EQQVKEEQLRT I TEE 
QVKYQNLVFNMDYQLDLNESGGHRRHRRETDYDTEKWFEISHDQ 
KNYVSIYANQKTSYCWWLKDYFNKNNYDHLNVSINRLETEAEFY 
AFDDFSQTIKLTNNSYQTVNIDVNFDNNLCILALLRFLLSLERF 
NILNIRSSYTRNQYNFEKIGELLETIFAVVFSHRHLQGIHLQVP 
CEAFQYLVNSSSQISVKDSQLQVYSFSTDLKLVDTNKVQDYFKF 
LQEFPRLTHVSQQAI PVS ATNAVENLNVLLKKVKHANLNLVS I P 
TQFNFDFYFVNLQHLKLEFGLEPNILTKQKLENLLLSIKQSKNL 
KFLRLNFYTYVAQETSRKQILKQATTIKNLKNNKNQEETPETKD 
ETPSESTSGMKFFDHLSELTELEDFSWLQATQEIYDSLHKLLI 
RS TNLKKFKLS YKYEMEKS KMDTF I DLKNI YETLNNLKRCS VN I 
SNPHGNISYELTNKDSTFYKFKLTLNQELQHAKYTFKQNEFQFN 
NVKSAKIESSSLESLEDIDSLiCKSIASCKNLQNVNI IASLLYPN 
NIQKNPFNKPNLLFFKQFEQLKNLENVSINCILDQHILNSISEF 
LEKNKKIKAFILKRYYLLQYYLDYTKLFKTLQQLPELNQVYINQ 
QLEELTVSEVHKQVWENHKQKAFYEPLCEFIKESSQTLQLIDFD 
QNTVSDDSIKKILESISESKYHHYLRLNPSQSSSLIKSENEEIQ 
ELLKACDEKGVLVKAYYKFPLCLPTGTYYDYNSDRW 

FIG. 22 



MKILFEFIQDKLDIDLQTNSTYKENLKCGHFNGLDEILTTCFAL 
PNSRKIALPGLPGDLSHKAVIDHCI IYLLTGELYNNVLTFGYKI 
ARNEDVNNSLFCHSANVNVTLLKGAAWKMFHSLVGTYAFVDLLI 
N YT V I Q FNGQ FFTQI VGNRCNE PHL P P KW VQRS S S S S AT AAQ I K 
QLTEPVTNKQFLHKLNINSSSFFPYSKILPSSSSIKKLTDLREA 
I F PTNLVKI PQRLKVRINLTLQKLLKRHKRLNYVS I LNS I CPPL 
EGTVLDLSHLSRQSPKERVLKFIIVILQKLLPQEMFGSKKNKGK 
IIKNLNLLLSLPLNGYLPFDSLLKKLRLKDFRWLFISDIWFTKH 
NFENLNQLAICFISWLFRQLIPKI IQTFFYCTEISSTVTIVYFR 
HDTWNKLITPFIVEYFKTYLVENNVCRNHNSYTLSNFNHSKMRI 
IPKKSNNEFRI IAIPCRGADEEEFTIYKENHKNAIQPTQKILEY 
LRNKRPTSFTKIYSPTQIADRIKEFKQRLLKKFNNVLPELYFMK 
FDVKS C YDS I PRMECMR I LKDALKNENGFFVRS QYFFNTNTGVL 
KLFNVVNASRVPKPYELY I DNVRTVHLSNQDV INVVEME I FKT A 
LWVEDKCYIREDGLFQGSSLSAPIVDLVYDDLLEFYSEFKASPS 
QDTLILKLADDFLIISTDQQQVINIKKLAMGGFQKYNAKANRDK 
I L AVS SQ SDDDTVI QFC AMH I F VKELE VWKH S STMNNFH IRSKS 
SKGIFRSLIALFNTRISYKTIDTNLNSTNTVLMQIDHVVKNISE 
CYKSAFKDLSINVTQNMQFHSFLQRIIEMTVSGCPITKCDPLIE 
YE VRFT I LNGFLES LSSNTSKF KDN I I L LRKE I QHLQAY I Y I Y I 
HIVN r-,^ 
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1 tcaatactat taattaataa ataaaaaaaa 
61 ctaaaaaaag ccataggctc ctataggcaa 
121 tctagaagtt tacaaaagcc agattgagca 
181 ggaggatctc aagcttttaa agttcaaaaa 
241 tgatgatgat gaagaaaaca actcaaataa 
3 01 gattaagtag caagtttaat tgataaaaaa 
3 61 tttgaacgaa gatgaaaaca aaaagaatgg 
421 attaagaacg attactgaag aataggttaa 
481 ccagttagat ttaaatgaga gtggtggcca 
541 tactgaaaaa tggtttgaaa tatctcatga 
601 ctaaaagaca tcatattgtt ggtggcttaa 
661 tcttaatgta agcattaaca gactagaaac 
721 ttcacaaaca atcaaactta ctaataattc 
781 tgataataat ctctgtatac tcgcattgct 
841 tattttgaat ataagatctt cttatacaag 
901 gctacttgaa actatcttcg cagttgtctt 
961 acaagttcct tgcgaagcgt tctaatattt 
1021 agatagctaa ttataggtat actctttctc 
1081 agtccaagat tattttaagt tcttataaga 
1141 ggctatccca gttagtgcta ctaacgctgt 
1201 caagcatgct aatcttaatt tagtttctat 
1261 tgttaattta taacatttga aattagagtt 
1321 aaagcttgaa aatctacttt tgagtataaa 
1381 aaacttttac acctacgttg cttaagaaac 
1441 aacaatcaaa aatctcaaaa acaataaaaa 
1501 aactccaagc gaaagcacaa gtggtatgaa 
1561 gcttgaagat ttcagcgtta acttgtaagc 
1621 acttttgatt agatcaacaa atttaaagaa 
1681 aaagagtaaa atggatacat tcatagatct 
1741 taaaagatgc tctgttaata tatcaaatcc 
1801 taaagattct actttttata aatttaagct 
1861 gtatactttt aagtagaacg aattttaatt 
1921 ttcctcatta gaaagcttag aagatattga 
1981 aaatttacaa aatgttaata ttatcgccag 
2041 tcctttcaat aagcccaatc ttctattttt 
2101 aaatgtatct atcaactgta ttcttgatca 
2161 agaaaagaat aaaaaaataa aagcattcat 
2221 tcttgattat actaaattat ttaaaacact 
2281 cattaattag caattagaag aattgactgt 
2341 ccacaagcaa aaagctttct atgaaccatt 
2401 cctttagcta atagattttg accaaaacac 
2461 agaatctata tctgagtcta agtatcatca 
2521 cagtttaatt aaatctgaaa acgaagaaat 
2581 aggtgtttta gtaaaagcat actataaatt 
2641 cgattacaat tcagatagat ggtgattaat 
2701 tgaatatttc tttgcttatt atttgaataa 
2761 atatatttta gttatttaat tcattatttt 
2821 aaaaaatcg 



gcaaactaca aagaaaatgt caaggcgtaa 
tgaaacaaat cttgattttg tattacaaaa 
ttataagacc tagtagtaat agatcaaaga 
ttaagattag gatggaaact ctggcaacga 
ataataagaa ttattaagga gagtcaatta 
agttggttct aaggtagaga aagatttgaa 
actttctgaa tagcaagtga aagaagagta 
gtattaaaat ttagtattta acatggacta 
tagaagacac agaagagaaa cagattatga 
ccaaaaaaat tatgtatcaa tttacgccaa 
agattatttt aataaaaaca attatgatca 
tgaagccgaa ttctatgcct ttgatgattt 
ttactagact gttaacatag acgttaattt 
tagattttta ttatcactag aaagattcaa 
aaattaatat aattttgaga aaattggtga 
ttctcatcgc cacttacaag gcattcattt 
agttaactcc tcatcataaa ttagcgttaa 
tacagactta aaattagttg acactaacaa 
attccctcgt ttgactcatg taagctagta 
agagaacctc aatgttttac ttaaaaaggt 
ccctacctaa ttcaattttg atttctactt 
tggattagaa ccaaatattt tgacaaaaca 
ataatcaaaa aatcttaaat ttttaagatt 
ctccagaaaa cagatattaa aacaagctac 
tcaagaagaa actcctgaaa ctaaagatga 
attttttgat catctttctg aattaaccga 
tacccaagaa atttatgata gcttgcacaa 
gttcaaatta agttacaaat atgaaatgga 
taagaatatt tatgaaacct taaacaatct 
tcatggaaac atttcttatg aactgacaaa 
gaccttaaac taagaattat aacacgctaa 
taataacgtt aaaagtgcaa aaattgaatc 
tagtctttgc aaatctattg cttcttgtaa 
tttgctctat cccaacaata tttagaaaaa 
caagcaattt gaataattga aaaatttgga 
gcatatactt aattctattt cagaattctt 
tttgaaaaga tattatttat tacaatatta 
tcaatagtta cctgaattaa attaagttta 
gagtgaagta cataagtaag tatgggaaaa 
atgtgagttt atcaaagaat catcctaaac 
tgtaagtgat gactctatta aaaagatttt 
ttatttgaga ttgaacccta gttaatctag 
ttaagaactt ctcaaagctt gcgacgaaaa 
ccctctatgt ttaccaactg gtacttatta 
taaatattag tttaaataaa tattaaatat 
tacatacaat agtcattttt agtgttttga 
aagtaaataa ttatttttca atcatttttt 
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Oxytricha LCVSYILSSFYYANLEENALQFLRKESMDPEKPETNLLMRLT 
Euplotes LCVSSILSSFYYATLEESSLGFLRDESMNPENPNVNLLMRLT 

FIG. 24 



ATTTATACTCATGAAAATCTTATTCGAGTTCATTCAAGACAAGCTTGACATTGATCTACA 
GACCAACAGTACTTACAAAGAAAATTTAAAATGTGGTCACTTCAATGGCCTCGATGAAAT 
TCTAACTACGTGTTTCGCACTACCAAATTCAAGAAAAATAGCATTACCATGCCTTCCTGG 
TGACTTAAGCCACAAAGCAGTCATTGATCACTGCATCATTTACCTGTTGACGGGCGAATT 
ATACAACAACGTACTAACATTTGGCTATAAAATAGCTAGAAATGAAGATGTCAACAATAG 
TCTTTTTTGCCATTCTGCAAATGTTAACGTTACGTTACTGAAAGGCGCTGCTTGGAAAAT 
GTTCCACAGTTTGGTCGGTACATACGCATTCGTTGATTTATTGATCAATTATACAGTAAT 
TCAATTTAATGGGCAGTTTTTCACTCAAATCGTGGGTAACAGATGTAACGAACCTCATCT 
GCCGCCCAAATGGGTCCAACGATCATCCTCATCATCCGCAACTGCTGCGCAAATCAAACA 
ACTTACAGAACCAGTGACAAATAAACAATTCTTACACAAGCTCAATATAAATTCCTCTTC 
TTTTTTTCCTTATAGCAAGATCCTTCCTTCATCATCATCTATCAAAAAGCTAACTGACTT 
GAGAGAAGCTATTTTTCCCACAAATTTGGTTAAAATTCCTCAGAGACTAAAGGTACGAAT 
TAATTTGACGCTGCAAAAGCTATTAAAGAGACATAAGCGTTTGAATTACGTTTCTATTTT 
GAATAGTATTTGCCCACCATTGGAAGGGACCGTATTGGACTTGTCGCATTTGAGTAGGCA 
ATCACCAAAGGAACGAGTCTTGAAATTTATCATTGTTATTTTACAGAAGTTATTACCCCA 
AGAAATGTTTGGCTCAAAGAAAAATAAAGGAAAAATTATCAAGAATCTAAATCTTTTATT 
AAGTTTACCCTTAAATGGCTATTTACCATTTGATAGTTTGTTGAAAAAGTTAAGATTAAA 
GGATTTTCGGTGGTTGTTCATTTCTGATATTTGGTTCACCAAGCACAATTTTGAAAACTT 
GAATCAATTGGCGATTTGTTTCATTTCCTGGCTATTTAGACAACTAATTCCCAAAATTAT 
ACAGACTTTTTTTTACTGCACCGAAATATCTTCTACAGTGACAATTGTTTACTTTAGACA 
TGATACTTGGAATAAACTTATCACCCCTTTTATCGTAGAATATTTTAAGACGTACTTAGT 
CGAAAACAACGTATGTAGAAACCATAATAGTTACACGTTGTCCAATTTCAATCATAGCAA 
AAT G AGGAT TATA C C AAAAAAAAG T AAT AAT GAGTT C AG G ATT AT TG C CAT C C C ATG C AG 
AGGGG CAGACGAAGAAGAATT CAC AAT TT ATAAGGAGAAT C ACAAAAATGC TAT C CAGC C 
CACTCAAAAAATTTTAGAATACCTAAGAAACAAAAGGCCGACTAGTTTTACTAAAATATA 
TTCTCCAACGCAAATAGCTGACCGTATCAAAGAATTTAAGCAGAGACTTTTAAAGAAATT 
TAATAATGTCTTACCAGAGCTTTATTTCATGAAATTTGATGTCAAATCTTGCTATGATTC 
CATACCAAGGATGGAATGTATGAGGATACTCAAGGATGCGCTAAAAAATGAAAATGGGTT 
TTTCGTTAGATCTCAATATTTCTTCAATACCAATACAGGTGTATTGAAGTTATTTAATGT 
TGTTAACG CTAG C AGAGTAC CAAAAC CTTATGAGCTATACATAGATAATGTGAGGACGGT 
TCATTTATCAAATCAGGATGTTATAAACGTTGTAGAGATGGAAATATTTAAAACAGCTTT 
GTGGGTTGAAGATAAGTGCTACATTAGAGAAGATGGTCTTTTTCAGGGCTCTAGTTTATC 
TGCTCCGATCGTTGATTTGGTGTATGACGATCTTCTGGAGTTTTATAGCGAGTTTAAAGC 
CAGTCCTAGCCAGGACACATTAATTTTAAAACTGGCTGACGATTTCCTTATAATATCAAC 
AGACCAACAGCAAGTGATCAATATCAAAAAGCTTGCCATGGGCGGATTTCAAAAATATAA 
TGCGAAAGCCAATAGAGACAAAATTTTAGCCGTAAGCTCCCAATCAGATGATGATACGGT 
TATTCAATTTTGTGCAATGCACATATTTGTTAAAGAATTGGAAGTTTGGAAACATTCAAG 
CACAATGAATAATTTCCATATCCGTTCGAAATCTAGTAAAGGGATATTTCGAAGTTTAAT 
AGCGCTGTTTAACACTAGAATCTCTTATAAAACAATTGACACAAATTTAAATTCAACAAA 
CACCGTTCTCATGCAAATTGATCATGTTGTAAAGAACATTTCGGAATGTTATAAATCTGC 
TTTTAAGGATCTATCAATTAATGTTACGCAAAATATGCAATTTCATTCGTTCTTACAACG 
CATCATTGAAATGACAGTCAGCGGTTGTCCAATTACGAAATGTGATCCTTTAATCGAGTA 
TGAGGTACGATTCACCATATTGAATGGATTTTTGGAAAGCCTATCTTCAAACACATCAAA 
ATTTAAAGATAATATCATTCTTTTGAGAAAGGAAATTCAACACTTGCAAGC 



FIG. 26 



22/78 














H 


CM 


00 




N 


H 


og 


J 


<D 


CO 


rH 




4-) 







r-H CN CO 

N Eh cm 

CD CO rH 

U W ft 



I 



h eg n 

N H CN 

<D CO rH 

4J W ft 



H CN n 

N Eh cm 

0) CO rH 

4J W ft 



23/78 



AKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKR 
VQLRDVSEAEVRQHREARPALLTSRLRFI PKPDGLRP I VNMDYWGARTFRREK 
RAERLTSRVKALFSVLNYERA 

FIG. 27 



GCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTC 
TTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACC 
GGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAAG 
AGGGTGCAGCTGCGGGACGTGTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGC 
CAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGACGGGC 
TGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCGCAGA 
GAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCGTGCT 
CAACTACGAGCGGGCGCG 

FIG. 28 



MTEHHTPKSRILRFLENQYVYLCTLNDYVQLVLRGSPASSYSNICERLRSDVQTSFSIFLHSTWGF 
DSKPDEGVQFSSPKCSQSELIANWKQMFDESFERRRNLLMKGFSMNHEDFRAMHVNGVQNDLVSTF 
PNYLISILESKNWQLLLEIIGSDAMHYLLSKGSIFEALPNDNYLQISGIPLFKNNVFEETVSKKRKR 
T I ETS I TQNKSARKEVS WNS I S I SRFSI FYRS SYKKFKQDLY FNLHS I CDRNTVHMWLQW I FPRQFG 
LINAFQVKQLHKVIPLVSQSTWPKRLLKVYPLIEQTAKRLHRISLSKVYNHYCPYIDTHDDEKILS 
YSLKPNQVFAFLRSILVRVFPKLIWGNQRIFEIILKDLETFLKLSRYESFSLHYLMSNIKISEIEWL 
VLGKRSNAKMCLSDFEKRKQIFAEFIYWLYNSFI IPILQSFFYITESSDLRNRTVYFRKDIWKLLCR 
PFITSMKMEAFEKINENNVRMDTQKTTLPPAVIRLLPKKNTFRLITNLRKRFLIKMGSNKKMLVSTN 
QTLRPVASILKHLINEESSGIPFNLEVYMKLLTFKKDLLKHRMFGRKKYFVRIDIKSCYDRIKQDLM 
FRIVKKKLKDPEFVIRKYATIHATSDRATKNFVSEAFSYFDMVPFEKVVQLLSMKTSDTLFVDFVDY 
WTKSSSEIFKMLKEHLSGHIVKIGNSQYLQKVGIPQGS ILSSFLCHFYMEDLIDEYLSFTKKKGSVL 
LRVVDDFLFITVNKKDAKKFLNLSLRGFEKHNFSTSLEKTVINFENSNGI INNTFFNESKKRMPFFG 
FSVNMRSLDTLLACPKIDEALFNSTSVELTKHMGKSFFYKILRSSLASFAQVFIDITHNSKFNSCCN 
IYRLGYSMCMRAQAYLKRMKDIFIPQRMFITDLLNVIGRKIWKKLAEILGYTSRRFLSSAEVKWLFC 
LGMRDGLKPSFKYHPCFEQLIYQFQSLTDLIKPLRPVLRQVLFLHRRIAD 



FIG. 29 
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Wild Type Telomerase Gene 
Wild Type Telomerase Gene 



1. Transform with linear fragment 
containing the telomerase gene 
disrupting with a LEU2 or ura4 marker 



M 



Telomerase Gene::M 



Wild Type Telomerase Gene 




Wild Type Telomerase Gene 



Wild Type (X2) 



2. Assay in selective media 



3. Sporulate, and grow on 
selective media 




Senescence? (X2) 



(These cells will show a senescence phenotype 

if the disrupted gene encodes a telomerase subunit.) 



FIG. 43 
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heterozygote diploid 
wt diploid 
wt haploid 
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after sporulation 
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FIG. 45 
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i 

met ser val tyr val val glu leu leu 
GC CAAGTT C CTGCACTGGCTG ATG AGT GTG TAC GTC GTC GAG CTG CTC 

10 20 
arg ser phe phe tyr val thr glu thr thr phe gin lys asn arg 
AGG TCT TTC TTT TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC AGG 

30 

leu phe phe tyr arg lys ser val trp ser lys leu gin ser ile 
CTC TTT TTC TAC CGG AAG AGT GTC TGG AGC AAG TTG CAA AGC ATT 

40 50 
gly ile arg gin his leu lys arg val gin leu arg glu leu ser 
GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG TCG 

60 

glu ala glu val arg gin his arg glu ala arg pro ala leu leu 
GAA GCA GAG GTC AGG CAG CAT CGG GAA GCC AGG CCC GCC CTG CTG 

7 ° 80 
thr ser arg leu arg phe ile pro lys pro asp gly leu arg pro 
ACG TCC AGA CTC CGC TTC ATC CCC AAG CCT GAC GGG CTG CGG CCG 

90 

ile val asn met asp tyr val val gly ala arg thr phe arg arg 
ATT GTG AAC ATG GAC TAC GTC GTG GGA GCC AGA ACG TTC CGC AGA 

100 110 

glu lys ala glu arg leu thr ser arg val lys ala leu phe 

GAA AAG ARG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG TTC 

120 

ser val leu asn tyr glu arg ala arg arg pro gly leu leu gly 
AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTG GGC 

130 140 

ala ser val leu gly leu asp asp ile his arg ala trp arg thr 
GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC ACC 

150 

phe val leu arg val arg ala gin asp pro pro pro glu leu tyr 
TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG CTG TAC 



160 



170 

phe val lys val asp val thr gly ala tyr asp thr ile pro gin 

TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC CAG 

180 

asp arg leu thr glu val ile ala ser ile ile lys pro gin asn 

GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG AAC 

190 200 

thr tyr cys val arg arg tyr ala val val gin lys ala ala met 

ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC ATG 
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gly thr ser 
GGC ACG TCC 

220 

gin gly ile 
CAG GGG ATC 



leu cys tyr 
CTG TGC TAC 

250 

arg asp gly 
CGG GAC GGG 



thr pro his 
ACA CCT CAC 

280 

arg gly val 
CGA GGT GTC 



val val asn 
GTG GTG AAC 

310 

phe val gin 
TTT GTT CAG 



leu leu asp 
CTG CTG GAT 

340 

tyr ala arg 
TAT GCC CGG 



phe lys ala 
TTC AAG GCT 

370 

arg leu lys 
CGG CTG AAG 



leu gin thr 
CTC CAG ACG 

400 

ala tyr arg 
GCG TAC AGG 



210 
ala arg pro 
GCA AGG CCT 



pro gin gly 
CCG CAG GGC 

240 

gly asp met 
GGC GAC ATG 



leu leu leu 
CTG CTC CTG 

270 

leu thr his 
CTC ACC CAC 



pro glu tyr 
CCT GAG TAT 

300 

phe pro val 
TTC CCT GTA 



met pro ala 
ATG CCG GCC 

330 

thr arg thr 
ACC CGG ACC 



thr ser ile 
ACC TCC ATC 

360 

gly arg asn 
GGG AGG AAC 



cys his ser 
TGT CAC AGC 

390 

val cys thr 
GTG TGC ACC 



phe his ala 
TTT CAC GCA 



ser arg ala 
TCA AGA GCC 



ser ile leu 
TCC ATC CTC 



glu asn lys 
GAG AAC AAG 



arg leu val 
CGT TTG GTG 



ala lys thr 
GCG AAA ACC 



gly cys val 
GGC TGC GTG 



glu asp glu 
GAA GAC GAG 



his gly leu 
CAC GGC CTA 



leu glu val 
CTG GAG GTG 



arg ala ser 
AGA GCC AGT 



met arg arg 
ATG CGT CGC 



leu phe leu 
CTG TTT CTG 



asn ile tyr 
AAC ATC TAC 



cys val leu 
TGT GTG CTG 



thr ser tyr 
ACG TCC TAC 

230 

ser thr leu 
TCC ACG CTG 



leu phe ala 
CTG TTT GCG 

260 
asp asp phe 
GAT GAT TTC 



phe leu arg 
TTC CTC AGG 

290 
val asn leu 
GTG AAC TTG 



ala leu gly 
GCC CTG GGT 

320 
phe pro trp 
TTC CCC TGG 



gin ser asp 
CAG AGC GAC 

350 
leu thr phe 
CTC ACC TTC 



lys leu phe 
AAA CTC TTT 

380 

asp leu gin 
GAT TTG CAG 



lys ile leu 
AAG ATC CTC 

410 
gin leu pro 
CAG CTC CCA 



val gin cys 
GTC CAG TGC 



leu cys ser 
CTC TGC AGC 



gly ile arg 
GGG ATT CGG 



leu leu val 
TTG TTG GTG 



thr leu val 
ACC CTG GTC 



arg lys thr 
CGG AAG ACA 



gly thr ala 
GGC ACG GCT 



cys gly leu 
TGC GGC CTG 



tyr ser ser 
TAC TCC AGC 



asn arg gly 
AAC CGC GGC 



gly val leu 
GGG GTC TTG 



val asn ser 
GTG AAC AGC 



leu leu gin 
CTG CTG CAG 



phe his gin 
TTT CAT CAG 
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420 

gin val trp lys asn pro his phe ser cys ala ser ser leu thr 
CAA GTT TGG AAG AAC CCA CAT TTT TCC TGC GCG TCA TCT CTG ACA 

430 440 

arg leu pro leu leu leu his pro glu ser gin glu arg arg asp 
CGG CTC CCT CTG CTA CTC CAT CCT GAA AGC CAA GAA CGC AGG GAT 

450 

val ala gly gly gin gly arg arg arg pro ser ala leu arg gly 
GTC GCT GGG GGC CAA GGG CGC CGC CGG CCC TCT GCC CTC CGA GGC 

460 470 

arg ala val ala val pro pro ser ile pro ala gin ala asp ser 
CGT GCA GTG GCT GTG CCA CCA AGC ATT CCT GCT CAA GCT GAC TCG 

480 

thr pro cys his leu arg ala thr pro gly val thr gin asp ser 
ACA CCG TGT CAC CTA CGT GCC ACT CCT GGG GTC ACT CAG GAC AGC 

490 500 

pro asp ala ala glu ser glu ala pro gly asp asp ala asp cys 
CCA GAC GCA GCT GAG TCG GAA GCT CCC GGG GAC GAC GCT GAC TGC 

510 

pro gly gly arg ser gin pro gly thr ala leu arg leu gin asp 
CCT GGA GGC CGC AGC CAA CCC GGC ACT GCC CTC AGA CTT CAA GAC 

520 530 

his pro gly leu met ala thr arg pro gin pro gly arg glu gin 
CAT CCT GGA CTG ATG GCC ACC CGC CCA CAG CCA GGC CGA GAG CAG 

540 

thr pro ala ala leu ser arg arg ala tyr thr ser gin gly gly 
ACA CCA GCA GCC CTG TCA CGC CGG GCT TAT ACG TCC CAG GGA GGG 

550 560 

arg gly gly pro his pro gly leu his arg trp glu ser glu ala 
AGG GGC GGC CCA CAC CCA GGC CTG CAC CGC TGG GAG TCT GAG GCC 

564 
OP 

TGA GTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAGGC 

CTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCGTTTTCACTTCCCCAC 

AGGCTGGCGTTCGGTCCACCCCAGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTCCACT 

CCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCTTCGCCCTGCCTTCC 

TTTGCCTTCCACCCCCACCATTCAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTTTGGG 

AATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGG 

GTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAAAATACTGAATATATGAGTT 

TTTCAGTTTTGGAAAAAAAAAAAAAAAAAAAAAAAAAA 
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Motif -1 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCP1 
consensus 



. LVVSLIRCFFYVTEQQKSYSKT . 
. FIIPILQSFFYITESSDLRNRT . 
. LIPKI IQTFFYCTEISSTVTIV . 
. YWELLRSFFYVTETTFQKNRL . 
FFY TE 



Motif 0 
Ep p!23 
Sp Tezl 
Sc Est2 
Hs TCP1 
consensus 



K 

p hhh K hR h R 

. KSLGFAPGKLRLIPKKT- -TFRPIMTFNKKIV 
. QKTTLPPAVIRLLPKKN- -TFRLITNLRKRFL 
. TLSNFNHSKMRIIPKKSNNEFRIIAIPCRGAD 
. ARPALLTSRLRF IPKPD - - GLRP I VNMDYVVG 
R PK R I 



Motif A 
Ep p!23 
Sp Tezl 
Sc Est2 
Hs TCP1 
consensus 



AF 

h hDh GY h 
. PKLFFATMDIEKCYDSVNREKLSTFLK. 
. RKKYFVRIDIKSCYDRIKQDLMFRIVK. 
. PELYFMKFDVKSCYDSIPRMECMRILK . 
. PELYFVKVDVTGAYDTIPQDRLTEVIA. 

F D YD 



■// 



Motif B 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCP1 
consensus 



hPQG pS hh 
. NGKFYKQTKGIPQGLCVSSILSSFYYA . 
. GNSQYLQKVGIPQGS ILSSFLCHFYME . 
. EDKCYIREDGLFQGSSLSAPIVDLVYD . 
. RATSYVQCQGIPQGSILSTLLCSLCYG . 
G QG S 



Motif C 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCP1 
consensus 



Y 

h F DD hhh 
. PNVNLLMRLTDDYLLITTQENN . 
. KKGSVLLRWDDFLFITVNKKD . 
. SQDTL I LKLADDFL I I STDQQQ . 
. RRDGLLLRLVDDFLLVTPHLTH . 
DD L 



Motif D 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCP1 
consensus 



Gh h cK 
.NVSRENGFKFNMKKL. . 
. LNLSLRGFEKHNFST . . 
. KKLAMGGFQKYNAKA . . 
. LRTL VRGVP E YGC VV . . 

G 
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1 GCAGCGCTGC GTCCTGCTGC GCACGTGGGA AGCCCTGGCC CCGGCCACCC 

51 CCGCGATGCC GCGCGCTCCC CGCTGCCGAG CCGTGCGCTC CCTGCTGCGC 

101 AGCCACTACC GCGAGGTGCT GCCGCTGGCC ACGTTCGTGC GGCGCCTGGG 

151 GCCCCAGGGC TGGCGGCTGG TGCAGCGCGG GGACCCGGCG GCTTTCCGCG 

201 CGNTGGTGGC CCANTGCNTG GTGTGCGTGC CCTGGGANGN ANGGCNGCCC 

251 CCCGCCGCCC CCTCCTTCCG CCAGGTGTCC TGCCTGAANG ANCTGGTGGC 

3 01 CCGAGTGCTG CANANGCTGT GCGANCGCGG CGCGAANAAC GTGCTGGCCT 

3 51 TCGGCTTCGC GCTGCTGGAC GGGGCCCGCG GGGGCCCCCC CGAGGCCTTC 

4 01 ACCACCAGCG TGCGCAGCTA CCTGCCCAAC ACGGTGACCG ACGCACTGCG 
4 51 GGGGAGCGGG GCGTGGGGGC TGCTGCTGCG CCGCGTGGGC GACGACGTGC 
501 TGGTTCACCT GCTGGCACGC TGCGCGNTNT TTGTGCTGGT GGNTCCCAGC 
551 TGCGCCTACC ANGTGTGCGG GCCGCCGCTG TACCAGCTCG GCGCTGCNAC 
601 TCAGGCCCGG CCCCCGCCAC ACGCTANTGG ACCCGAANGC GTCTGGGATC 
651 CAACGGGCCT GGAACCATAG CGTCAGGGAG GCCGGGGTCC CCCTGGGCTG 
701 CCAGCCCCGG GTGCGAGGAG GCGCGGGGGC AGTGCCAGCC GAAGTCTGCC 
751 GTTGCCCAAG AGGCCCAGGC GTGGCGCTGC CCCTGAGCCG GAGCGGACGC 
8 01 CCGTTGGGCA GGGGTCCTGG GCCCACCCGG GCAGGACGCC TGGACCGAGT 
851 GACCGTGGTT TCTGTGTGGT GTCACCTGCC AGACCCGCCG AAGAAG C C AC 
901 CTCTTTGGAG GGTGCGCTCT CTGGCACGCG CCACTCCCAC CCATCCGTGG 
951 GCCGCCAGCA CCACGCGGGC CCCCCATCCA CATCGCGGCC ACCACGTCCT 

10 01 GGGACACGCC TTGTCCCCCG GTGTACGCCG AGACCAAGCA CTTCCTCTAC 

1051 TCCTCAGGCG ACAAGNACAC TGCGNCCCTC CTTCCTACTC AATATATCTG 

1101 AGGCCCAGCC TGACTGGCGT TCGGGAGGTT CGTGGAGACA NTCTTTCTGG 

1151 TTCCAGGCCT TGGATGCCAG GATTCCCCGC AGGTTGCCCC GCCTGCCCCA 

12 01 GCGNTACTGG CAAATGCGGC CCCTGTTTCT GGAGCTGCTT GGGAACCACG 

12 51 CGCAGTGCCC CTACGGGGTG TTCCTCAAGA CGCACTGCCC GCTGCGAGCT 

13 01 GCGGTCACCC CAGCAGCCGG TGTCTGTGCC CGGGAGAAGC CCCAGGGCTC 

13 51 TGTGGCGGCC CC CGAGGAGG AGGAACACAG ACCCCCGTCG CCTGGTGCAG 

14 01 CTGCTCCGCC AGCACAGCAG CCCCTGGCAG GTGTACGGCT TCGTGCGGGC 

14 51 CTGCCTGCGC CGGCTGGTGC CCCCAGGCCT CTGGGGCTCC AGGCACAACG 

15 01 AACGCCGCTT CCTCAGGAAC AC CAAGAAGT TCATCTCCCT GGGGAAG CAT 

15 51 GCCAAGCTCT CGCTGCAGGA GCTGACGTGG AAGATGAGCG TGCGGGACTG 

16 01 CGCTTGGCTG CGCAGGAGCC CAGGGGTTGG CTGTGTTCCG GCCGCAGAGC 

16 51 ACCGTCTGCG TGAGGAGATC CTGGCCAAGT TCCTGCACTG GCTGATGAGT 

17 01 GTGTACGTCG TCGAGCTGCT CAGGTCTTTC TTTTATGTCA CGGAGAC CAC 
1751 GTTTCAAAAG AACAGGCTCT TTTTCTACCG GAAGAGTGTC TGGAGCAAGT 

18 01 TGCAAAGCAT TGGAATCAGA CAGCACTTGA AGAGGGTGCA GCTGCGGGAG 

18 51 CTGTCGGAAG CAGAGGT C AG GCAGCATCGG GAAGCCAGGC CCGCCCTGCT 

19 01 GACGTCCAGA CTCCGCTTCA TCCCCAAGCC TGACGGGCTG CGGCCGATTG 
19 51 TGAACATGGA CTACGTCGTG GGAGCCAGAA CGTTCCGCAG AGAAAAGAGG 
2 0 01 GCCGAGCGTC TCACCTCGAG GGTGAAGGCA CTGTTCAGCG TGCTCAACTA 
2 0 51 CGAGCGGGCG CGGCGCCCCG GCCTCCTGGG CGCCTCTGTG CTGGGCCTGG 
2101 ACGATATCCA CAGGGCCTGG CGCACCTTCG TGCTGCGTGT GCGGGCCCAG 
2151 GACCCGCCGC CTGAGCTGTA CTTTGT CAAG GTGGATGTGA CGGGCGCGTA 
2 2 01 CGACACCATC CCCCAGGACA GGCTCACGGA GGTCATCGCC AGC AT CAT C A 
22 51 AACCCCAGAA CACGTACTGC GTGCGTCGGT ATGCCGTGGT C C AGAAGG C C 
2 3 01 GCCCATGGGC ACGTCCGCAA GGCCTTCAAG AGCCACGTCT CTACCTTGAC 
2 3 51 AGACCTCCAG CCGTACATGC GACAGTTCGT GGCTCACCTG CAGGANAACA 
24 01 GCCCGCTGAG GGATGCCGTC GTCATCGAGC AGAGCTCCTC CCTGAATGAG 
24 51 GCCAGCAGTG GCCTCTTCGA CGTCTTCCTA CGCTTCATGT GCCACCACGC 
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2 5 01 CGTGCGCATC AGGGGCAAGT CCTACGTCCA GTGCCAGGGG ATCCCGCAGG 

2551 GCTCCATCCT CTCCACGCTG CTCTGCAGCC TGTGCTACGG CGACATGGAG 

26 01 AACAAGCTGT TTGCGGGGAT TCGGCGGGAC GGGCTGCTCC TGCGTTTGGT 

2 651 GGATGATTTC TTGTTGGTGA CACCTCACCT CACCCACGCG AAAACCTTCC 

2 7 01 TCAGGACCCT GGTCCGAGGT GTCCCTGAGT ATGGCTGCGT GGTGAACTTG 

2 7 51 CGGAAGACAG TGGTGAACTT CCCTGTAGAA GACGAGGCCC TGGGTGGCAC 

2 8 01 GGCTTTTGTT CAGATGCCGG CCCACGGCCT ATTCCCCTGG TGCGGCCTGC 
H~ 2 851 TGCTGGATAC CCGGACCCTG GAGGTGCAGA GCGACTACTC CAGCTATGCC 
53 2 901 CGGACCTCCA TCAGAGCCAG TCTCACCTTC AACCGCGGCT TCAAGGCTGG 
Q 2 951 GAGGAACATG CGTCGCAAAC TCTTTGGGGT CTTGCGGCTG AAGTGTCACA 
m 3 0 01 GCCTGTTTCT GGATTTGCAG GTGAACAGCC TCCAGACGGT GTGCACCAAC 
1*1 3 051 ATCTACAAGA TCCTCCTGCT GCAGGCGTAC AGGTTTCACG CATGTGTGCT 
ri 3101 GCAGCTCCCA TTTCATCAGC AAGTTTGGAA GAACCCCACA TTTTTCCTGC 

- 3151 GCGTCATCTC TGACACGGCC TCCCTCTGCT ACTCCATCCT GAAAGC CAAG 

ffl 3 2 01 AACGCAGGGA TGTCGCTGGG GGCCAAGGGC GCCGCCGGCC CTCTGCCCTC 

19 3 2 51 CGAGGCCGTG CAGTGGCTGT GCCACCAAGC ATTCCTGCTC AAGCTGACTC 

( ' 3 3 01 GACACCGTGT CACCTACGTG CCACTCCTGG GGTCACTCAG GACAGCCCAG 

pt 3 3 51 ACGCAGCTGA GTCGGAAGCT CCCGGGGACG ACGCTGACTG CCCTGGAGGC 

% 34 01 CGCAGCCAAC CCGGCACTGC CCTCAGACTT C AAGAC CAT C CTGGACTGAT 

T 34 51 GGCCACCCGC CCACAGCCAG GCCGAGAGCA GACACCAGCA GCCCTGTCAC 

I 8 * 3 501 GCCGGGCTCT ACGTCCCAGG GAGGGAGGGG CGGCCCACAC CCAGGCCCGC 

(0 3 5 51 ACCGCTGGGA GTCTGAGGCC TGAGTGAGTG TTTGGCCGAG GCCTGCATGT 

PI 3 6 01 CCGGCTGAAG GCTGAGTGTC CGGCTGAGGC CTGAG CGAGT GTCCAGCCAA 

JH 3 6 51 GGG CTGAGTG TCCAGCACAC CTGCCGTCTT CACTTCCCCA CAGGCTGGCG 

v ™ 3 7 01 CTCGGCTCCA CCCCAGGGCC AGCTTTTCCT CAC CAGGAGC CCGGCTTCCA 

3 7 51 CTCCCCACAT AGGAATAGTC CATCCCCAGA TTCGCCATTG TTCACCCCTC 
3 8 01 GCCCTGCCCT CCTTTGCCTT CCACCCCCAC CATCCAGGTG GAGACCCTGA 
3 8 51 GAAGGACCCT GGGAGCTCTG GGAATTTGGA GTGACCAAAG GTGTGCCCTG 
3 9 01 TACACAGGCG AGGACCCTGC ACCTGGATGG GGGTCCCTGT GGGTCAAATT 
3 951 GGGGGGAGGT GCTGTGGGAG TAAAATACTG AAT AT AT GAG TTTTTCAGTT 
4001 TTGAAAAAAA AAAAAAAAAA AAAAAAAAA 
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GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCGATGCC 

I + + + + + + 

CGTCGCGACGCAGGACGACGCGTGCACCCTTCGGGACCGGGGCCGGTGGGGGCGCTACGG 



AALRPAAHVGSPGPGHPRDA 
QRCVLLRTWEALAPATPAMP 
SAASCCARGKPWPRPPPRCR 



GCGCGCTCCCCGCTGCCGAGCCGTGCGCTCCCTGCTGCGCAGCCACTACCGCGAGGTGCT 

, + + + + + 

CGCGCGAGGGGCGACGGCTCGGCACGCGAGGGACGACGCGTCGGTGATGGCGCTCCACGA 



ARSPLPSRALPAAQPLPRGA 
RAPRCRAVRSLLRSHYREVL- 
AL PAAE PCAPCCAATTARCC- 

GCCGCTGGCCACGTTCGTGCGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGG 

121 + + + + + + 180 

CGGCGACCGGTGCAAGCACGCCGCGGACCCCGGGGTCCCGACCGCCGACCACGTCGCGCC 

AAGHVRAAPGAPGLAAGAAR 
PLATFVRRLGPQGWRLVQRG 
RWPRSCGAWGPRAGGWCSAG- 

GGACCCGGCGGCTTTCCGCGCGNTGGTGGCCCANTGCNTGGTGTGCGTGCCCTGGGANGN 

181 + + + + + + 240 

CCTGGGCCGCCGAAAGGCGCGCNACCACCGGGTNACGNACCACACGCACGGGACCCTNCN 

GPGGFPR?GGP??GVRALG? 
DPAAFRA?VA?C?VCVPW??~ 
TRRLSARWWP?AWCACPG? ? - 



ANGGCNGCCCCCCGCCGCCCCCTCCTTCCGCCAGGTGTCCTGCCTGAANGANCTGGTGGC 

, + + + + + 

TNCCGNCGGGGGGCGGCGGGGGAGGAAGGCGGTCCACAGGACGGACTTNCTNGACCACCG 



^AAPRRPLLPPGVLPE??GG 
•? "PPPAAPSFRQVSCL? ?LVA- 
G^PPPPPPSARCPA*??WWP- 



CCGAGTGCTGCANANGCTGTGCGANCGCGGCGCGAANAACGTGCTGGCCTTCGGCTTCGC 

, + + + + + 

GGCTCACGACGTNTNCGACACGCTNGCGCCGCGCTTNTTGCACGACCGGAAGCCGAAGCG 



P S AA ? AVR ? RRE ? RAGLRL R 
R VL^ ?LC?RGA?NVLAFGFA- 
ECC^ ?CA?AAR?TCWPSASR- 



GCTGCTGGACGGGGCCCGCGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTA 

, + + + + + 

CGACGACCTGCCGCGGGCGCCCCCGGGGGGGCTCCGGAAGTGGTGGTCGCACGCGTCGAT 



AAGRGPRGPPRGLHHQRAQL 
LLDGARGGPPEAFTTSVRSY- 
CWTGPAGAPPRPS PPACAAT- 



CCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCG 

+ + + + + + 

GGACGGGTTGTGCCACTGGCTGCGTGACGCCCCCTCGCCCCGCACCCCCGACGACGACGC 



PAQHGDRRTAGERGVGAAAA 
LPNTVTDALRGSGAWGLLLR- 
CPTR* PTHCGGAGRGGCCCA- 
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a PRGRRRAGSPAGTLR? ?CAG 

b RVGDDVLVHLLARCA? FVLV- 

C AWATTCWFTCWHAAR ? LCWW- 

GGNTCCCAGCTGCGCCTACCANGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCNAC 

541 + + + + + + 600 

CCNAGGGTCGACGCGGATGGTNCACACGCCCGGCGGCGACATGGTCGAGCCGCGACGNTG 

a GSQLRLP ?VRAAAVPARRC ? 

b ?PSCAY?VCGPPLYQLGAAT- 

c ?PAAPT?CAGRRCTSSAL?L- 

TCAGGCCCGGCCCCCGCCACACGCTANTGGACCCGAANGCGTCTGGGATCCAACGGGCCT 

6 01 + + + + + + 660 

AGTCCGGGCCGGGGGCGGTGTGCGATNACCTGGGCTTNCGCAGACCCTAGGTTGCCCGGA 

a SGPAPATR7WTR? RLGSNGP 

b QARPPPHA?GPE?VWDPTGL- 

c RPGPRHTL ?DP?ASGI QRAW- 

» GGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCTGCCAGCCCCGGGTGCGAGGAG 

^ 661 + + + + 4- + 720 

P CCTTGGTATCGCAGTCCCTCCGGCCCCAGGGGGACCCGACGGTCGGGGCCCACGCTCCTC 

m a GT IASGRPGS PWAAS PGCEE 

;*: b EP*RQGGRGPPGLPAPGARR- 

c NHS VREAGVPLGCQ PRVRGG- 

m GCGCGGGGGCAGTGCCAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGC 

:fj 7 2i + + + + + + 780 

CGCGCCCCCGTCACGGTCGGCTTCAGACGGCAACGGGTTCTCCGGGTCCGCACCGCGACG 

5 

ri a ARGQCQPKSAVAQEAQAWRC 

b RGGSASRSLPLPKRPRRGAA- 

;P C AGAVPAEVCRCPRGPGVALP- 

m CCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCC 

781 + + + + + + 840 

l:t GGGACTCGGCCTCGCCTGCGGGCAACCCGTCCCCAGGACCCGGGTGGGCCCGTCCTGCGG 

III 

a P*AGADARWAGVLGPPGQDA 

b PEPERTPVGQGSWAHPGRTP- 

c LSRSGRPLGRGPGPTRAGRL- 

TGGACCGAGTGACCGTGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCAC 
841 + + + + + + 900 

ACCTGGCTCACTGGCACCAAAGACACACCACAGTGGACGGTCTGGGCGGCTTCTTGGGTG 

a W T E * PWFLCGVTCQTRRRSH 

b GPSDRGFCVVSPARPAEEAT- 

C DRVTVVSVWCHLPDPPK KPP- 

CTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCA 
901 + + + + + + 960 

GAGAAACCTCCCACGCGAGAGACCGTGCGCGGTGAGGGTGGGTAGGCACCCGGCGGTCGT 

a LFGGCALWHAPLPPIRGPPA 

b SLEGALSGTRHSHPSVGRQH 

C LWRVRSLARATPTHPWAAST- 

CCACGCGGGCCCCCCATCCACATCGCGGCCACCACGTCCTGGGACACGCCTTGTCCCCCG 
961 + + + + + + 1020 

GGTGCGCCCGGGGGGTAGGTGTAGCGCCGGTGGTGCAGGACCCTGTGCGGAACAGGGGGC 
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PRGPPIHIAATTSWDTPCPP 
HAGPPSTSRPPRPGTRLVPR- 
TRAPHPHRGHHVLGHALSPG- 

GTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGNACACTGCGNCCCTC 

1021 + + + + + + 1080 

CACATGCGGCTCTGGTTCGTGAAGGAGATGAGGAGTCCGCTGTTCNTGTGACGCNGGGAG 

VYAETKHFLYSSGDK? TA? L 
CTPRPSTSSTPQAT7TLRPS- 
VRRDQALPLLLRRQ?HC?PP- 

CTTCCTACTCAATATATCTGAGGCCCAGCCTGACTGGCGTTCGGGAGGTTCGTGGAGACA 

1081 + + + + + + 1140 

GAAGGATGAGTTATATAGACTCCGGGTCGGACTGACCGCAAGCCCTCCAAGCACCTCTGT 

LPTQYI *GPA*LAFGRFVET 
FLLNISEAQPDWRSGGSWR? - 
SYS IYLRPSLTGVREVRGD?- 

NTCTTTCTGGTTCCAGGCCTTGGATGCCAGGATTCCCCGCAGGTTGCCCCGCCTGCCCCA 

1141 + + + + + + 1200 

NAGAAAGACCAAGGT C CGGAAC CTACGGTC CTAAGGGGCGTCCAACGGGG CGGACGGGGT 

? FLVPGLGCQDS PQVAPPAP 
SFWFQALDARIPRRLPRLPQ- 
LSGSRPWMPGFPAGCPACPS- 

GCGNTACTGGCAAATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCC 

1201 + + + + + + 1260 

CGCNATGACCGTTTACGCCGGGGACAAAGACCTCGACGAACCCTTGGTGCGCGTCACGGG 

A ? LANAAPVSGAAWE PRAVP 
RYWQMRPLFLELLGNHAQC P 
? TGKCGPCFWS CLGTTRSAP- 

CTACGGGGTGTTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGG 

1261 + + + + + + 1320 

GATGCCCCACAAGGAGTTCTGCGTGACGGGCGACGCTCGACGCCAGTGGGGTCGTCGGCC 

LRGVPQDALPAASCGHPSSR 
YGVFLKTHCPLRAAVTPAAG- 
TGCS SRRTARCE LRS PQQPV- 

TGTCTGTGCCCGGGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGAACACAG 

1321 + + + + + + 1380 

ACAGACACGGGCCCTCTTCGGGGTCCCGAGACACCGCCGGGGGCTCCTCCTCCTTGTGTC 

CLCPGEAPGLCGGPRGGGTQ 
VCAREKPQGSVAAPEEEEHR- 
SVPGRS PRALWRP PRRRNTD- 

ACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCT 

1381 + + + + + + 1440 

TGGGGGCAGCGGACCACGTCGACGAGGCGGTCGTGTCGTCGGGGACCGTCCACATGCCGA 

TPVAWCSCSASTAAPGRCTA 
PPSPGAAAPPAQQPLAGVRL 
PRRLVQLLRQHS S PWQVYGF- 

TCGTGCGGGCCTGCCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACG 

1441 + + + + + + 1500 

AGCACGCCCGGACGGACGCGGCCGACCACGGGGGTCCGGAGACCCCGAGGTCCGTGTTGC 
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a SCGPACAGWCPQASGAPGTT 

b RAGLPAPAGAPRPLGLQAQR- 

c VRACLRRLVP PGLWGS RHNE- 

AACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCATGCCAAGCTCT 

1501 + + + + + + 1560 

TTGCGGCGAAGGAGTCCTTGTGGTTCTTCAAGTAGAGGGACCCCTTCGTACGGTTCGAGA 

a NAASSGTPRSSSPWGSMPSS 

b TPLPQEHQEVHLPGEACQAL- 

c RRFLRNTKKF I SLGKHAKLS- 

CGCTGCAGGAGCTGACGTGGAAGATGAGCGTGCGGGACTGCGCTTGGCTGCGCAGGAGCC 

1561 + + + + + + 1620 

GCGACGTCCTCGACTGCACCTTCTACTCGCACGCCCTGACGCGAACCGACGCGTCCTCGG 

a RCRS *RGR*ACGTALGCAGA 

b AAGADVEDERAGLRLAAQEP 

C LQELTWKMSVRDCAWLRRSP- 

CAGGGGTTGGCTGTGTTCCGGCCGCAGAGCACCGTCTGCGTGAGGAGATCCTGGCCAAGT 

1621 + + + + + + 1680 

U GTCCCCAACCGACACAAGGCCGGCGTCTCGTGGCAGACGCACTCCTCTAGGACCGGTTCA 

a QGLAVFRPQSTVCVRRSWPS 
U l b RGWLCSGRRAPSA*GDPGQV- 

W c GVGCVPAAEHRLREE I LAKF- 



TCCTGCACTGGCTGATGAGTGTGTACGTCGTGGAGCTGCTCAGGTCTTTCTTTTATGTCA 

1681 + + + + + + 1740 

AGGACGTGACCGACTACTCACACATGCAGCAGCTCGACGAGTCCAGAAAGAAAATACAGT 



a SCTG**VCTSSSCSGLSFMS 
'2 b PALADECVRRRAAQVFLLCH 

c LHWLMSVYVVELLRS FFYVT- 

j f\ CGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTGGAGCAAGT 

J;!; 1741 + + + + + + 1800 

GCCTCTGGTGCAAAGTTTTCTTGTCCGAGAAAAAGATGGCCTTCTCACAGACCTCGTTCA 

jj s s 

a RRPRFKRTGSFSTGRVSGAS 

b GDHVSKEQALFLPEECLEQV- 

c ETTFQKNRLFFYRKSVWSKL- 

TGCAAAGCATTGGAATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAGCTGTCGGAAG 

1801 + + + + + + 1860 

ACGTTTCGTAACCTTAGTCTGTCGTGAACTTCTCCCACGTCGACGCCCTCGACAGCCTTC 

a CKALESDST*RGCS CGSCRK 

b AKHWNQTALEEGAAAGAVGS 
c QSIGIRQHLKRVQLRELSEA- 

CAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCA 

1861 + + + + + + 1920 

GTCTCCAGTCCGTCGTAGCCCTTCGGTCCGGGCGGGACGACTGCAGGTCTGAGGCGAAGT 

a QRSGSIGKPGPPC*RPDSAS 

b RGQAASGSQARPADVQTPLH- 

c EVRQHREARPALLTSRLRF I - 

TCCCCAAGCCTGACGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAA 

1921 + + + + + + 1980 

AGGGGTTCGGACTGCCCGACGCCGGCTAACACTTGTACCTGATGCAGCACCCTCGGTCTT 
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a SPSLTGCGRL*"TWTTSWEPE 

b PQA*RAAADCEHGLRRGSQN- 

C PKE?DGLRP IVNMDYVVGART- 

CGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCG 

1981 + + + + + + 2040 

GCAAGGCGTCTCTTTTCTCCCGGCTCGCAGAGTGGAGCTCCCACTTCCGTGACAAGTCGC 

a RSAEKRGPSVSPRG*RHCSA 

b VPQRKEGRASHLEGEGTVQR 

C FRREKRAERLTS RVKAL F S V - 

TGCTCAACTACGAGCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCTGGGCCTGG 

2041 + + + + + + 2100 

ACGAGTTGATGCTCG CCCGCGC CGCGGGGC CGGAGGACCCG CGGAGACACGACCCGGACC 

a CSTTSGRGAPASWAPLCWAW 
b AQLR AGAAPRPPGRLCAGPG 

C LNYERARRPGLLGASVLGLD- 

ACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGACCCGCCGC 

2101 + + + + + + 2160 

TGCTATAGGTGTCCCGGACCGCGTGGAAGCACGACGCACACGCCCGGGTCCTGGGCGGCG 

a TISTGPGAPSCCVCGPRTRR 
b RYPQGLAHLRAACAGPGPAA 
C DIHRAWRTFVLRVRAQDPPP- 

CTGAGCTGTACTTTGTCAAGGTGGATGTGACGGGCGCGTACGACACCATCCCCCAGGACA 

2161 + + + + + + 2220 

GACTCGACATGAAACAGTTCCACCTACACTGCCCGCGCATGCTGTGGTAGGGGGTCCTGT 

a LSCTLSRWM*RARTTPSPRT 

b *AVLCQGGCDGRVRHHPPGQ- 

C ELYFVKVDVTGAYDT I PQDR- 

GGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGTGCGTCGGT 

2221 + + + + + + 2280 

CCGAGTGCCTCCAGTAGCGGTCGTAGTAGTTTGGGGTCTTGTGCATGACGCACGCAGCCA 

a GSRRSSPAS SNPRTRTACVG 

b AHGGHRQHHQTPEHVLRASV- 

C LTEVIAS I IKPQNTYCVRRY- 

ATGCCGTGGTCCAGAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAGCCACGTCT 

2281 + + + + + + 2340 

TACGGCACCAGGTCTTCCGGCGGGTACCCGTGCAGGCGTTCCGGAAGTTCTCGGTGCAGA 

a MPWSRRPPMGTSARPSRATS 

b CRGPEGRPWARPQGLQEPRL- 

c AVVQKAAHGHVRKAFKSHVS- 

CTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAGGANAACA 

2341 + + + + + + 2400 

GATGGAACTGTCTGGAGGTCGGCATGTACGCTGTCAAGCAC CGAGTGGACGT C CTNTTGT 

a LP*QTSSRTCDSSWLTCR?T 

b YLDRPPAVHATVRGSPAG ?Q- 

c TLTDLQPYMRQFVAHLQ7NS- 

GCCCGCTGAGGGATGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGCCAGCAGTG 

2401 + + + + + + 2460 

CGGGCGACTCCCTACGGCAGCAGTAGCTCGTCTCGAGGAGGGACTTACTCCGGTCGTCAC 
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a AR*GMPSSSSRAPP*MRPAV 
b PAEGCRRHRAELLPE * G Q Q W 

c PLRDAVVIEQSSSLNEASSG- 

GCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCATCAGGGGCAAGT 

2461 + + + + + + 2520 

CGGAGAAGCTGCAGAAGGATGCGAAGTACACGGTGGTGCGGCACGCGTAGTCCCCGTTCA 

a ASSTSSYASCATTPCASGAS 

b PLRRLPTLHVPPRRAHQGQV- 

C LFDVFLRFMCHHAVRI RGKS- 

CCTACGTCCAGTGCCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCTGCAGCC 

2521 + + + + + + 2580 

GGATGCAGGTCACGGTCCCCTAGGGCGTCCCGAGGTAGGAGAGGTGCGACGAGACGTCGG 

a PTSSARGSRRAPSSPRCSAA 

b LRPVPGDPAGLHPLHAALQP- 

C YVQCQGI PQGS ILSTLLCSL- 

j A TGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGCTGCTCC 

1 -ss. 2581 + + + + + + 2640 

ACACGATGCCGCTGTACCTCTTGTTCGACAAACGCCCCTAAGCCGCCCTGCCCGACGAGG 

HI a CATATWRTSCLRGFGGTGCS 

i\\ b VLRRHGEQAVCGDSAGRAAP- 

c CYGDMENKLFAGIRRDGLLL- 

HI TGCGTTTGGTGGATGATTTCTTGTTGGTGACACCTCACCTCACCCACGCGAAAACCTTCC 

ifk 2641 + + + + + + 2700 

' ACGCAAACCACCTACTAAAGAACAACCACTGTGGAGTGGAGTGGGTGCGCTTTTGGAAGG 

1 3 a CVWWMISCW*HLTSPTRKPS 

J;f b AFGG*FLVGDTSPHPRENLP- 

= ; 7 C RLVDDFLLVTPHLTHAKTFL- 



T C AGGAC CCTGGT CCGAGGTGT CC CTGAGT ATGGCTGCGTGGTGAACTTGCGGAAGACAG 

p 2701 + + + + + + 2760 

171 AGTCCTGGGACCAGGCTCCACAGGGACTCATACCGACGCACCACTTGAACGCCTTCTGTC 

i y 

a SGPWSEVSLSMAAW*TCGRQ 

b QDPGPRCP*VWLRGELAEDS 

C RT LVRGV P EYGCVVNLRKTV- 

TGGTGAACTTCCCTGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTTCAGATGCCGG 

2761 + + + + + + 2820 

ACCACTTGAAGGGACATCTTCTGCTCCGGGACCCACCGTGCCGAAAACAAGTCTACGGCC 

a W* T S L * KTRPWVARLLFRCR 

b GELPCRRRGPGWHGFCSDAG- 

C VNFPVEDEALGGTAFVQMPA- 

CCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGGTGCAGA 

2821 + + + + + + 2880 

GGGTGC CGGATAAGGGGACCACGCCGGACGACGAC C TATGGGC CTGGGAC CT CCACGT CT 

a PTAYSPGAACCWIPGPWRCR 

b PRPIPLVRPAAGYPDPGGAE- 

c HGLFPWCGLLLDTRTLEVQS- 

GCGACTACTCCAGCTATGCCCGGACCTCCATCAGAGCCAGTCTCACCTTCAACCGCGGCT 

2881 + + + + + + 2940 

CG CTGATGAGGTCGATACGGGC CTGGAGGT AGTCT CGGTCAGAGTGGAAGTTGGCGC CGA 
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a ATTPAMPGPPSEPVSPSTAA 

b RLLQLCPDLHQSQSHLQPRL- 

c DYS SYARTS IRASLTFNRGF- 

TCAAGGCTGGGAGGAACATGCGTCGCAAACTCTTTGGGGTCTTGCGGCTGAAGTGTCACA 

2941 + + + + + + 3000 

AGTTCCGACCCTCCTTGTACGCAGCGTTTGAGAAACCCCAGAACGCCGACTTCACAGTGT 

a SRLGGTCVANSLGSCG* SVT 

b QGWEEHASQTLWGLAAEVSQ- 

C KAGRNMRRKL FGVLRLKCHS- 

GCCTGTTTCTGGATTTGCAGGTGAACAGCCTCCAGACGGTGTGCACCAACATCTACAAGA 

3001 + + + + + + 3060 

CGGACAAAGACCTAAACGTCCACTTGTCGGAGGTCTGCCACACGTGGTTGTAGATGTTCT 

a ACFWI C R * TASRRCAPTSTR 

b PVSGFAGEQPPDGVHQHLQD 
c LFLDLQVNSLQTVCTNIYKI- 

TCCTCCTGCTGCAGGCGTACAGGTTTCACGCATGTGTGCTGCAGCTCCCATTTCATCAGC 

iZ 3061 + + + + + + 3120 

} J AGGAGGACGACGTCCGCATGTCCAAAGTGCGTACACACGACGTCGAGGGTAAAGTAGTCG 

m a SSCCRRTGFTHVCCSSHFIS 

"| b PPAAGVQVSRMCAAAPISSA- 

C LLLQAYRFHACVLQL PFHQQ- 

Sl 

||1 AAGTTTGGAAGAACCCCACATTTTTCCTGCGCGTCATGTCTGACACGGCCTCCCTCTGCT 

:U 3121 + + + + + + 3180 

TTCAAACCTTCTTGGGGTGTAAAAAGGACGCGCAGTAGAGACTGTGCCGGAGGGAGACGA 

|3 a KFGRTPHFSCASSLTRPPSA 

ji b SLEEPHI FPARHL* HGLPLL 

l\ c VWKNPTFFLRVISDTASLCY- 



3 ass 



AC TCCAT C CTGAAAGC C AAGAACGCAGGGATGT CGCTGGGGGCCAAGGGCGCCGCCGGCC 

3181 + + + + + + 3240 

TGAGGTAGGACTTTCGGTTCTTGCGTCCCTACAGCGACCCCCGGTTCCCGCGGCGGCCGG 

a TPS * KPRTQGCRWGPRAPPA 

b LHPESQERRDVAGGQGRRRP- 

c SILKAKNAGMSLGAKGAAGP- 

CTCTGCCCTCCGAGGCCGTGCAGTGGCTGTGCCACCAAGCATTCCTGCTCAAGCTGACTC 

3241 + + + + + + 3300 

GAGACGGGAGGCT CCGGC ACGTCAC CGACACGGTGGTTCGT AAGG ACGAGTTCGACTGAG 

a LCPPRPCSGCATKHSCSS*L 

b SALRGRAVAVPP S I PAQAD S - 

C LPSEAVQWLCHQAFLLKLTR- 

GACACCGTGTCACCTACGTGCCACTCCTGGGGTCACTCAGGACAGCCCAGACGCAGCTGA 

3301 + + + + + + 3360 

CTGTGGCACAGTGGATGCACGGTGAGGACCCCAGTGAGTCCTGTCGGGTCTGCGTCGACT 

a DTVSPTCHSWGHSGQPRRS* 

b TPCHLRATPGVTQDS PDAAE 

c HRVTYVPLLGSLRTAQTQLS- 

GTCGGAAGCTCCCGGGGACGACGCTGACTGCCCTGGAGGCCGCAGCCAACCCGGCACTGC 

3361 + + + + + + 3420 

CAGCCTTCGAGGGCCCCTGCTGCGACTGACGGGACCTCCGGCGTCGGTTGGGCCGTGACG 
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a VGSSRGRR* LPWRPQPTRHC 

b SEAPGDDADCPGGRSQPGTA- 

c RKLPGTTLTALEAAANPALP- 

CCTCAGACTTCAAGACCATCCTGGACTGATGGCCACCCGCCCACAGCCAGGCCGAGAGCA 

3421 + + + + + + 3480 

GGAGTCTGAAGTTCTGGTAGGACCTGACTACCGGTGGGCGGGTGTCGGTCCGGCTCTCGT 

a PQTSRPSWTDGHPPTARPRA 

b LRLQDHPGLMATRPQPGREQ- 

c SDFKTILD*WPPAHSQAESR- 

GACACCAGCAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGCCCACAC 

3481 + + + + + + 3540 

CTGTGGTCGTCGGGACAGTGCGGCCCGAGATGCAGGGTCCCTCCCTCCCCGCCGGGTGTG 

a DTSSPVTPGSTSQGGRGGPH 
b TPAALSRRALRPREGGAAHT 
c HQQPCHAGLYVPGREGRPTP- 

I f CCAGGCCCGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCTGCATGT 

p 3541 + + + + + + 3600 

GGT C CGGGCGTGGCGACCCTCAGAC TCCGGACTCAC T CACAAACCGGCTCCGGACGTACA 

t/l a PGPHRWESEA*VSVWPRPAC 

hi b QARTAGSLRPE*VFGRGLHV- 

Sj c RPAPLGV*GLSECLAEACMS- 

J£J C C GGCTGAAGGCTGAGTGTC CGGCTGAGGCCTGAGCGAGTGTC CAGCCAAGGGCTGAGTG 

U2 3601 + + + + + + 3660 

3 GGCCGACTTCCGACTCACAGGCCGACTCCGGACTCGCTCACAGGTCGGTTCCCGACTCAC 

fl 

n Z a PAEG*VSG*GLSECPAKG*V 

*h b RLKAECPAEA*ASVQPRAEC 

|«a c G* RLSVRLRPERVS SQGLSV- 

m 

]-Z TCCAGCACACCTGCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCCAGGGCC 

U 3661 + + + + + + 3720 

1 1| AGGTCGTGTGGACGGCAGAAGTGAAGGGGTGTCCGACCGCGAGCCGAGGTGGGGTCCCGG 

a S STPAVFTS PQAGARLHPRA 

b PAHLPSSLP HRLALGSTPGP 

C QHTCRLHFPTGWRSAPPQGQ- 

AGCTTTTCCTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATCCCCAGA 

3721 + - + + + + + 3780 

TCGAAAAGGAGTGGTCCTCGGGCCGAAGGTGAGGGGTGTATCCTTATCAGGTAGGGGTCT 

a SFSSPGARLPLPT*E*SIPR 

b AFPHQEPGFHSPHRNSPSPD- 

c LFLTRSPASTPHIGIVHPQI- 

TTCGCCATTGTTCACCCCTCGCCCTGCCCTCCTTTGCCTTCCACCCCCACCATCCAGGTG 

3781 + + + + + + 3840 

AAGCGGTAACAAGTGGGGAGCGGGACGGGAGGAAACGGAAGGTGGGGGTGGTAGGTCCAC 

a FAIVHPSPCPPLPSTPTIQV 

b SPLFTPRPALLCLPPPPSRW- 

C RHCS PLALPS FAFHPHH PGG- 

GAGACCCTGAGAAGGACCCTGGGAGCTGTGGGAATTTGGAGTGACCAAAGGTGTGCCCTG 

3841 + + + + + + 3900 

CTCTGGGACTCTTCCTGGGACCCTCGAGACCCTTAAACCTCACTGGTTTCCACACGGGAC 
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a ETLRRTLGALGIWSDQRCAL 

b RP *EGPWELWEFGVTKGVPC 

c DPEKDPGSSGNLE*PKVCPV- 

I ^ TACACAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCAAATTGGGGGGAGGT 

I« 3901 + + + + + + 3960 

ATGTGTCCGCTCCTGGGACGTGGACCTACCCCCAGGGACACCCAGTTTAACCCCCCTCCA 

fsj a YTGEDPAPGWGSLWVKLGGG 

™? b TQARTLHLDGGPCGSNWGEV- 

y C HRRGPCTWMGVPVGQ I GGRC- 

5 "is 

?v s 

1 13 GCTGTGGGAGTAAAATACTGAATATATGAGTTTTTCAGTTTTGAAAAAAAAAAAAAAAAA 

? 



3961 + + + + + + 4020 

TTTTTTTT 



4;:: a A V G V K Y * IYEFFSFEKKKKK 

L.^ b LWE*NTEYMSFSVLKKKKKK- 

c CGSKILNI *VFQF*KKKKKK~ 

H r B? 

O AAAAAAAAA 

jjj 4021 4029 



a K K K 

b K K 

c K K 



FIG. 51 

(CONTINUED) 



+ 



+ 72/78 




73/78 



i 

met 

GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCG ATG 

10 

pro arg ala pro arg cys arg ala val arg ser leu leu arg ser 

CCG CGC GCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC 

20 30 

his tyr arg glu val leu pro leu ala thr phe val arg arg leu 

CAC TAC CGC GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG 

40 

gly pro gin gly trp arg leu val gin arg gly asp pro ala ala 

GGG CCC CAG GGC TGG CGG CTG GTG CAG CGC GGG GAC CCG GCG GCT 

50 60 

phe arg ala leu val ala gin cys leu val cys val pro trp asp 

TTC CGC GCG CTG GTG GCC CAG TGC CTG GTG TGC GTG CCC TGG GAC 

70 

ala arg pro pro pro ala ala pro ser phe arg gin val ser cys 

GCA CGG CCG CCC CCC GCC GCC CCC TCC TTC CGC CAG GTG TCC TGC 

80 90 

leu lys glu leu val ala arg val leu gin arg leu cys glu arg 

CTG AAG GAG CTG GTG GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC 

100 

gly ala lys asn val leu ala phe gly phe ala leu leu asp gly 

GGC GCG AAG AAC GTG CTG GCC TTC GGC TTC GCG CTG CTG GAC GGG 

110 120 

ala arg gly gly pro pro glu ala phe thr thr ser val arg ser 

GCC CGC GGG GGC CCC CCC GAG GCC TTC ACC ACC AGC GTG CGC AGC 

130 

tyr leu pro asn thr val thr asp ala leu arg gly ser gly ala 

TAC CTG CCC AAC ACG GTG ACC GAC GCA CTG CGG GGG AGC GGG GCG 

140 150 

trp gly leu leu leu arg arg val gly asp asp val leu val his 

TGG GGG CTG CTG CTG CGC CGC GTG GGC GAC GAC GTG CTG GTT CAC 

160 

leu leu ala arg cys ala leu phe val leu val ala pro ser cys 

CTG CTG GCA CGC TGC GCG CTC TTT GTG CTG GTG GCT CCC AGC TGC 

170 180 

ala tyr gin val cys gly pro pro leu tyr gin leu gly ala ala 

GCC TAC CAG GTG TGC GGG CCG CCG CTG TAC CAG CTC GGC GCT GCC 

190 

thr gin ala arg pro pro pro his ala ser gly pro arg arg arg 

ACT CAG GCC CGG CCC CCG CCA CAC GCT AGT GGA CCC CGA AGG CGT 
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200 210 
leu gly cys glu arg ala trp asn his ser val arg glu ala gly 
CTG GGA TGC GAA CGG GCC TGG AAC CAT AGC GTC AGG GAG GCC GGG 

220 

val pro leu gly leu pro ala pro gly ala arg arg arg gly gly 
GTC CCC CTG GGC CTG CCA GCC CCG GGT GCG AGG AGG CGC GGG GGC 

230 240 
ser ala ser arg ser leu pro leu pro lys arg pro arg arg gly 
AGT GCC AGC CGA AGT CTG CCG TTG CCC AAG AGG CCC AGG CGT GGC 

250 

ala ala pro glu pro glu arg thr pro val gly gin gly ser trp 
GCT GCC CCT GAG CCG GAG CGG ACG CCC GTT GGG CAG GGG TCC TGG 

260 270 
ala his pro gly arg thr arg gly pro ser asp arg gly phe cys 
GCC CAC CCG GGC AGG ACG CGT GGA CCG AGT GAC CGT GGT TTC TGT 

280 

val val ser pro ala arg pro ala glu glu ala thr ser leu glu 
GTG GTG TCA CCT GCC AGA CCC GCC GAA GAA GCC ACC TCT TTG GAG 

290 300 
gly ala leu ser gly thr arg his ser his pro ser val gly arg 
GGT GCG CTC TCT GGC ACG CGC CAC TCC CAC CCA TCC GTG GGC CGC 

310 

gin his his ala gly pro pro ser thr ser arg pro pro arg pro 
CAG CAC CAC GCG GGC CCC CCA TCC ACA TCG CGG CCA CCA CGT CCC 

320 330 
trp asp thr pro cys pro pro val tyr ala glu thr lys his phe 
TGG GAC ACG CCT TGT CCC CCG GTG TAC GCC GAG ACC AAG CAC TTC 

340 

leu tyr ser ser gly asp lys glu gin leu arg pro ser phe leu 
CTC TAC TCC TCA GGC GAC AAG GAG CAG CTG CGG CCC TCC TTC CTA 

350 360 
leu ser ser leu arg pro ser leu thr gly ala arg arg leu val 
CTC AGC TCT CTG AGG CCC AGC CTG ACT GGC GCT CGG AGG CTC GTG 

370 

glu thr ile phe leu gly ser arg pro trp met pro gly thr pro 
GAG ACC ATC TTT CTG GGT TCC AGG CCC TGG ATG CCA GGG ACT CCC 



380 



390 



arg arg leu pro arg leu pro gin arg tyr trp gin met arg pro 
CGC AGG TTG CCC CGC CTG CCC CAG CGC TAC TGG CAA ATG CGG CCC 

400 

leu phe leu glu leu leu gly asn his ala gin cys pro tyr qly 
CTG TTT CTG GAG CTG CTT GGG AAC CAC GCG CAG TGC CCC TAC GGG 



410 



420 



val leu leu lys thr his cys pro leu arg ala ala val thr pro 
GTG CTC CTC AAG ACG CAC TGC CCG CTG CGA GCT GCG GTC ACC CCA 
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ala ala gly 
GCA GCC GGT 



ala pro glu 
GCC CCC GAG 



leu arg gin 
CTC CGC CAG 



ala cys leu 
GCC TGC CTG 



his asn glu 
CAC AAC GAA 



leu gly lys 
CTG GGG AAG 



met ser val 
ATG AGC GTG 



gly cys val 
GGC TGT GTT 



ala lys phe 
GCC AAG TTC 



leu arg ser 
CTC AGG TCT 



arg leu phe 
AGG CTC TTT 



ile gly ile 
ATT GGA ATC 

ser glu ala 
TCG GAA GCA 



leu thr ser 
CTG ACG TCC 



pro ile val 
CCG ATT GTG 



val cys ala 
GTC TGT GCC 

440 

glu glu asp 
GAG GAG GAC 



his ser ser 
CAC AGC AGC 

470 

arg arg leu 
CGC CGG CTG 



arg arg phe 
CGC CGC TTC 

500 

his ala lys 
CAT GCC AAG 



arg asp cys 
CGG GAC TGC 

530 

pro ala ala 
CCG GCC GCA 



leu his trp 
CTG CAC TGG 

560 

phe phe tyr 
TTC TTT TAT 



phe tyr arg 
TTC TAC CGG 

590 

arg gin his 
AGA CAG CAC 

glu val arg 
GAG GTC AGG 

620 

arg leu arg 
AGA CTC CGC 



asn met asp 
AAC ATG GAC 



arg 
CGG 


glu 
GAG 


lys 
AAG 


thr 
ACA 


asp 
GAC 


pro 
CCC 


pro 
CCC 


trp 
TGG 


4 b U 

gin 
CAG 


val 
GTG 


pro 
CCC 


pro 
CCA 


leu 
CTC 


arg 
AGG 


490 
asn 
AAC 


leu 
CTC 


ser 
TCG 


leu 
CTG 


ala 
GCT 


trp 
TGG 


520 
leu 
CTG 


glu 

GAG 


his 
CAC 


arg 
CGT 


leu 
CTG 


met 
ATG 


550 
ser 
AGT 


val 
GTC 


thr 
ACG 


glu 
GAG 


lys 
AAG 


ser 
AGT 


580 
val 
GTC 


leu 
TTG 

gin 
CAG 


lys 
AAG 

his 
CAT 


arg 
AGG 
610 
arg 
CGG 


phe 
TTC 


ile 
ATC 


pro 
CCC 


tyr 
TAC 


val 
GTC 


640 
val 
GTG 



pro gin gly 
CCC CAG GGC 



arg arg leu 
CGT CGC CTG 



val tyr gly 
GTG TAC GGC 



gly leu trp 
GGC CTC TGG 



thr lys lys 
ACC AAG AAG 



gin glu leu 
CAG GAG CTG 



arg arg ser 
CGC AGG AGC 



leu arg glu 
CTG CGT GAG 



val tyr val 
GTG TAC GTC 



thr thr phe 
ACC ACG TTT 



trp ser lys 
TGG AGC AAG 



val gin leu 
GTG CAG CTG 

glu ala arg 
GAA GCC AGG 



lys pro asp 
AAG CCT GAC 



gly ala arg 
GGA GCC AGA 



ser val ala 
TCT GTG GCG 

450 
val gin leu 
GTG CAG CTG 



phe val arg 
TTC GTG CGG 

480 
gly ser arg 
GGC TCC AGG 



phe ile ser 
TTC ATC TCC 

510 

thr trp lys 
ACG TGG AAG 



pro gly val 
CCA GGG GTT 

540 

glu ile leu 
GAG ATC CTG 



val glu leu 
GTC GAG CTG 

570 

gin lys asn 
CAA AAG AAC 



leu gin ser 
TTG CAA AGC 

600 

arg glu leu 
CGG GAG CTG 

pro ala leu 
CCC GCC CTG 

630 

gly leu arg 
GGG CTG CGG 



thr phe arg 
ACG TTC CGC 
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arg glu lys 
AGA GAA AAG 



phe ser val 
TTC AGC GTG 



gly ala ser 
GGC GCC TCT 



thr phe val 
ACC TTC GTG 



tyr phe val 
TAC TTT GTC 



gin asp arg 
CAG GAC AGG 



asn thr tyr 
AAC ACG TAC 



his gly his 
CAT GGG CAC 



thr asp leu 
ACA GAC CTC 



glu thr ser 
GAG ACC AGC 



ser leu asn 
TCC CTG AAT 



phe met cys 
TTC ATG TGC 



gin cys gin 
CAG TGC CAG 



cys ser leu 
TGC AGC CTG 



ile arg arg 
ATT CGG CGG 



;5o 

arg ala glu 
AGG GCC GAG 



leu asn tyr 
CTC AAC TAC 

680 

val leu gly 
GTG CTG GGC 



leu arg val 
CTG CGT GTG 

710 

lys val asp 
AAG GTG GAT 



leu thr glu 
CTC ACG GAG 

740 

cys val arg 
TGC GTG CGT 



val arg lys 
GTC CGC AAG 

770 

gin pro tyr 
CAG CCG TAC 



pro leu arg 
CCG CTG AGG 

800 

glu ala ser 
GAG GCC AGC 



his his ala 
CAC CAC GCC 

830 

gly ile pro 
GGG ATC CCG 



cys tyr gly 
TGC TAC GGC 

860 

asp gly leu 
GAC GGG CTG 
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arg leu thr 
CGT CTC ACC 

670 

glu arg ala 
GAG CGG GCG 



leu asp asp 
CTG GAC GAT 

700 

arg ala gin 
CGG GCC CAG 



val thr gly 
GTG ACG GGC 

730 

val ile ala 
GTC ATC GCC 



arg tyr ala 
CGG TAT GCC 

760 

ala phe lys 
GCC TTC AAG 



met arg gin 
ATG CGA CAG 

790 

asp ala val 
GAT GCC GTC 



ser gly leu 
AGT GGC CTC 

820 

val arg ile 
GTG CGC ATC 



gin gly ser 
CAG GGC TCC 

850 

asp met glu 
GAC ATG GAG 



leu leu arg 
CTC CTG CGT 



ser arg val 
TCG AGG GTG 



arg arg pro 
CGG CGC CCC 



ile his arg 
ATC CAC AGG 



asp pro pro 
GAC CCG CCG 



ala tyr asp 
GCG TAC GAC 



ser ile ile 
AGC ATC ATC 



val val gin 
GTG GTC CAG 



ser his val 
AGC CAC GTC 



phe val ala 
TTC GTG GCT 



val ile glu 
GTC ATC GAG 



phe asp val 
TTC GAC GTC 



arg gly lys 
AGG GGC AAG 



ile leu ser 
ATC CTC TCC 



asn lys leu 
AAC AAG CTG 



leu val asp 
TTG GTG GAT 



660 

lys ala leu 
AAG GCA CTG 



gly leu leu 
GGC CTC CTG 

690 

ala trp arg 
GCC TGG CGC 



pro glu leu 
CCT GAG CTG 

720 

thr ile pro 
ACC ATC CCC 



lys pro gin 
AAA CCC CAG 

750 

lys ala ala 
AAG GCC GCC 



ser thr leu 
TCT ACC TTG 

780 

his leu gin 
CAC CTG CAG 



gin ser ser 
CAG AGC TCC 

810 

phe leu arg 
TTC CTA CGC 



ser tyr val 
TCC TAC GTC 

840 

thr leu leu 
ACG CTG CTC 



phe ala gly 
TTT GCG GGG 

870 

asp phe leu 
GAT TTC TTG 
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880 

leu val thr pro his leu thr his ala lys thr phe leu arg thr 
TTG GTG ACA CCT CAC CTC ACC CAC GCG AAA ACC TTC CTC AGG ACC 

890 900 
leu val arg gly val pro glu tyr gly cys val val asn leu arg 
CTG GTC CGA GGT GTC CCT GAG TAT GGC TGC GTG GTG AAC TTG CGG 

910 

lys thr val val asn phe pro val glu asp glu ala leu gly gly 
AAG ACA GTG GTG AAC TTC CCT GTA GAA GAC GAG GCC CTG GGT GGC 

920 930 
thr ala phe val gin met pro ala his gly leu phe pro trp cys 
ACG GCT TTT GTT CAG ATG CCG GCC CAC GGC CTA TTC CCC TGG TGC 

940 

gly leu leu leu asp thr arg thr leu glu val gin ser asp tyr 
GGC CTG CTG CTG GAT ACC CGG ACC CTG GAG GTG CAG AGC GAC TAC 

950 960 
ser ser tyr ala arg thr ser ile arg ala ser leu thr phe asn 
TCC AGC TAT GCC CGG ACC TCC ATC AGA GCC AGT CTC ACC TTC AAC 

970 

arg gly phe lys ala gly arg asn met arg arg lys leu phe gly 
CGC GGC TTC AAG GCT GGG AGG AAC ATG CGT CGC AAA CTC TTT GGG 

980 990 
val leu arg leu lys cys his ser leu phe leu asp leu gin val 
GTC TTG CGG CTG AAG TGT CAC AGC CTG TTT CTG GAT TTG CAG GTG 

1000 

asn ser leu gin thr val cys thr asn ile tyr lys ile leu leu 
AAC AGC CTC CAG ACG GTG TGC ACC AAC ATC TAC AAG ATC CTC CTG 

1010 10 20 
leu gin ala tyr arg phe his ala cys val leu gin leu pro phe 
CTG CAG GCG TAC AGG TTT CAC GCA TGT GTG CTG CAG CTC CCA TTT 

1030 

his gin gin val trp lys asn pro thr phe phe leu arg val ile 
CAT CAG CAA GTT TGG AAG AAC CCC ACA TTT TTC CTG CGC GTC ATC 

1040 105 0 
ser asp thr ala ser leu cys tyr ser ile leu lys ala lys asn 
TCT GAC ACG GCC TCC CTC TGC TAC TCC ATC CTG AAA GCC AAG AAC 

1060 

ala gly met ser leu gly ala lys gly ala ala gly pro leu pro 
GCA GGG ATG TCG CTG GGG GCC AAG GGC GCC GCC GGC CCT CTG CCC 

107 0 1080 
ser glu ala val gin trp leu cys his gin ala phe leu leu lys 
TCC GAG GCC GTG CAG TGG CTG TGC CAC CAA GCA TTC CTG CTC AAG 

1090 

leu thr arg his arg val thr tyr val pro leu leu gly ser leu 
CTG ACT CGA CAC CGT GTC ACC TAC GTG CCA CTC CTG GGG TCA CTC 
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iioo mo 

arg thr ala gin thr gin leu ser arg lys leu pro gly thr thr 
AGG ACA GCC CAG ACG CAG CTG AGT CGG AAG CTC CCG GGG ACG ACG 

1120 

leu thr ala leu glu ala ala ala asn pro ala leu pro ser asp 
CTG ACT GCC CTG GAG GCC GCA GCC AAC CCG GCA CTG CCC TCA GAC 

1130 1132 
phe lys thr ile leu asp OP 

TTC AAG ACC ATC CTG GAC TGA TGGCCACCCGCCCACAGCCAGGCCGAGAGCAGA 

CACCAGCAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGCCCACACCC 

AGGCCCGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCTGCATGTCC 

GGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTC 

CAGCACACCTGCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCCAGGGCCAG 

CTTTTCYTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATCCCCAGATT 

CGCCATTGTTCACCCYTCGCCCTGCCYTCCTTTGCCTTCCACCCCCACCATCCAGGTGGA 

GACCCTGAGAAGGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGTGCCCTGTA 

CACAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCAAATTGGGGGGAGGTGC 

TGTGGGAGTAAAATACTGAATATATGAGTTTTTCAGTTTTGRAAAAAAAAAAAAAAAAAA 

AAAAAAAAAA 
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